Methods for genetic control of plant pest infestation and compositions thereof

ABSTRACT

The present invention is directed to controlling plant pest infestation, and particularly plant nematode infestation, by inhibiting one or more biological functions in the plant pest. The invention discloses methods and compositions for use in controlling plant pest infestation by providing one or more different recombinant double stranded RNA molecules in the diet of the pest in order to achieve a reduction in pest infestation through suppression of pest gene expression. The invention is also directed to methods for making transgenic plants that express the double stranded RNA molecules, to methods for detecting cells comprising the disclosed sequences, and to methods for detecting the disclosed sequences in biological samples.

CROSS REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of priority to U.S. ProvisionalPatent Application Ser. No. 60/655,875 filed Feb. 24, 2005, and thesequence listing filed along with that application, which isincorporated herein by reference in its entirety.

FIELD OF THE INVENTION

The present invention relates generally to the field of molecularbiology and more specifically to the genetic control of plant pests, andeven more particularly to the genetic control of Heterodera nematodeinfestations in plants. More specifically, the present invention relatesto methods for modifying expression of one or more polynucleotide and/orprotein molecules in one or more cells or tissues of a pest species. Thepresent invention discloses substantially the entire genome sequence ofthe plant nematode pest, Heterodera glycines, and describes the use ofthese sequences to modify the expression of one or more targetpolynucleotide or protein molecules in at least the cells of aHeterodera species by providing in its diet a dsRNA that comprises apart of, or all, or substantially all of one or more polynucleotidemolecules of the present invention.

BACKGROUND OF THE INVENTION

Plants and animals are targets of many different pests, including butnot limited to nematode and insect pest species. Crops are often thetargets of nematode infestations. Chemical nematicides are not effectivein eradicating the nematode infestations. Chemical pesticidal agents arenot selective and exert their effects on non-target fauna as well, ofteneffectively sterilizing for a period of time a field over which thechemical nematicidal agents have been applied. Some chemical pesticidalagents have been shown to accumulate in food, and to exhibit adverseeffects on workers that manufacture and apply such chemical agents. Thusthere has been a long felt need for methods for controlling oreradicating nematode pest infestation on or in plants, i.e., methodswhich are selective, environmentally inert, non-persistent,biodegradable, and that fit well into pest resistance managementschemes. Plant biotechnology provides a means to control pestinfestations by providing plants that express one or more pest controlagents. Recombinant pest control agents have generally been reported tobe proteins selectively toxic to a target pest that are expressed by thecells of a recombinant plant. Recently, small RNA molecules provided inthe diet of the pest species Meloidogyne incognita have been shown toexhibit effects on the viability of the pest by affecting geneexpression in the pest cells (Tobias et al. WO 01/37654 A2). Recombinantapproaches to plant pest control can be selective, and areenvironmentally inert and non-persistent because they are fullybiodegradable.

The phenomenon of dsRNA mediated gene silencing has been demonstrated ina number of plant and animal systems (Fire et al. 1998 Nature391:806-811; Waterhouse et al. 1998 PNAS USA 95:13959-13964; Tabara etal. 1998 Science 282:430-431; Fire et al. WO 99/32619 A1; Trick et al.WO 2004/005485 A2). Methods for delivering dsRNA into the animal systemsinvolved generating transgenic insects that express double stranded RNAmolecules or injecting dsRNA solutions into the body of the animal orwithin the egg sac prior to or during embryonic development. Doublestranded RNA mediated gene suppression has been demonstrated in plantparasitic nematodes either by providing dsRNA or miRNA's in thenematodes' diet or by soaking the nematodes in solutions containing suchRNA molecules (Atkinson et al., (The University of Leeds) WO 03/052110A2; Trick et al., (Kansas State University Research Foundation) US2004-009876A1). Cyst nematodes (Heterodera and Globodera species) areparticularly damaging pests of crop plants. Cyst nematodes include butare not limited to Heterodera avenae, H. cruciferae, H. glycines, H.hordecalis, H. latipons, H. oryzae, H. oryzicola, H. rostochinesis, H.zeae, H. schachtii, G. achilleae, G. artemisiae, G. mexicana, G.millefolii, G. pallida, G. rostochiensis, G. tabacum, G. tabacumsolanacearum, G. tabacum tabacum, G. tabacum virginiae, Globodera sp.Bouro, Globodera sp. Canha, Globodera sp. Ladeiro, Globodera sp. NewZealand-EK-2004, and Globodera sp. Peru-EK-2004. These species are knownto parasitize a wide variety of crops including, but not limited tobarley, corn, oats, rice, rye, wheat, cabbage, cauliflower, soybean,sugar beet, spinach, mustards, and potato. Cyst nematodes areparticularly problematic. Eggs persist and remain viable in the soil formany years. Genetic resistance by conventional crop breeding has limitedsuccess in identifying resistance genes effective against the widevariety of races and biotypes of the cyst nematodes. Of particularconcern is the soybean cyst nematode, Heterodera glycines, hereinreferred to as SCN.

Therefore, there exists a need for improved methods and compositionsuseful to modulate gene expression by repressing, delaying or otherwisereducing gene expression within a particular plant nematode pest for thepurpose of controlling the nematode infestation or to introduce novelagronomically valuable phenotypic traits.

SUMMARY OF THE INVENTION

The present invention comprises methods and compositions for inhibitingexpression of one or more target genes and proteins at least in cystnematodes such as members of the Heterodera and Globodera species. Morespecifically, the present invention comprises a method of modulating orinhibiting expression of one or more target genes in Heterodera glycinesand Heterodera schactii, to cause cessation of feeding, growth,development, reproduction and infectivity and eventually result in thedeath of the nematode pest. The method comprises introduction of partialor fully, stabilized double-stranded RNA (dsRNA) or its modified forms,such as small interfering RNA (siRNA) or micro RNA (miRNA) sequences,into the cells of the nematode wherein expression of at least one ormore target genes is inhibited in the target nematode pest, whereininhibition of the one or more target genes exerts a deleterious effectupon the nematode pest, wherein the dsRNA, siRNA, or miRNA are derivedfrom the target gene sequences, specifically inhibit such target genesin the target pest, and so are specific to nematode pests such as thoseof the Heterodera species. It is specifically contemplated that themethods and compositions of the present invention will be useful inlimiting or eliminating Heterodera infestation in or on any cystnematode host by providing one or more compositions comprising dsRNAmolecules in the diet of the nematode, wherein the diet is all or partof a plant cell.

In another aspect of the present invention, DNA molecules of the presentinvention comprise molecules that function as promoter sequences,polypeptide coding sequences, non-coding regulatory sequences, orpolyadenylation sequences isolated from the genome of the soybean cystnematode, the polynucleotide sequence of which is at least from about75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92,93, 94, 95, 96, 97, 98, 99, or about 100% identical to sequencesselected from the group consisting of SEQ ID NO: 1 through SEQ IDNO:45568, the complement thereof, or a portion thereof. A DNA moleculeselected from the group consisting of SEQ ID NO:97730 through SEQ IDNO:119145 exhibits promoter activity, and a DNA molecule selected fromthe group consisting of SEQ ID NO:45569 through SEQ ID NO:47643comprises at least one protein coding sequence, whether or not acomplete open reading frame is exhibited.

Accordingly, in another aspect of the present invention, a set ofisolated and purified polynucleotide sequences as set forth in SEQ IDNO:45569 through SEQ ID NO:47643 are provided as target sequences forthe design of DNA constructs that express a stabilized dsRNA, siRNA, ormiRNA molecule for inhibition of expression of a target gene in anematode pest. A stabilized dsRNA, siRNA, or miRNA molecule can comprisetwo or more polynucleotide molecules that are arranged in a sense and anantisense orientation relative to at least one promoter, wherein thepolynucleotide molecule that comprises a sense strand and an antisensestrand are linked or connected by a spacer sequence of at least fromabout five to about one thousand nucleotides, wherein the sense strandand the antisense strand are at least about the same length, and whereineach of the two polynucleotide sequences shares at least about 80%sequence identity, at least about 90%, at least about 95%, at leastabout 98%, or even about 100% sequence identity, to a polynucleotidesequence as set forth in one of SEQ ID NO:45569 through SEQ ID NO:47643.

The present invention provides a method for identifying a DNA moleculefor use as a DNA construct expressing a dsRNA-mediated gene silencingsequence in a plant cell, comprising selecting a target polynucleotidemolecule of a Heterodera glycines polynucleotide sequence comprising 21or more contiguous nucleotides wherein said polynucleotide sequence isselected from the group consisting of SEQ ID NO:45569-50775, SEQ IDNO:45569-47643, and SEQ ID NO:47644-50775.

The present invention also provides a recombinant DNA molecule for usein plant transformation, constructed to contain at least onepolynucleotide molecule transcribed as a single stranded RNA molecule.The single stranded RNA molecule is capable of forming in vivo a doublestranded RNA molecule through intermolecular hybridization that, whenprovided in the diet of a nematode pest, inhibits the expression of atleast one target gene in one or more cells of the target organism. Thepolynucleotide molecule is operably linked to at least one promotersequence that functions in a transgenic plant cell to transcribe thepolynucleotide molecule into one or more ribonucleic acid molecules. TheRNA molecule(s) self assemble into double stranded RNA molecules and areprovided in the diet of a target pest that feeds upon the transgenicplant. The provision of the dsRNA molecule in the diet of the pestachieves the desired inhibition of expression of one or more targetgenes within the pest organism, resulting in fecundicity, morbidity,and/or mortality of the target pest.

The present invention also provides a recombinant plant cell having inits genome at least one recombinant DNA sequence that is transcribed toproduce at least one dsRNA molecule that functions when the cell and/orits contents are ingested by a target nematode or pest to inhibit theexpression of at least one target gene in the target nematode or pest.The dsRNA molecule is transcribed from all or a portion of apolynucleotide molecule that at least in part exhibits from about 75 toabout 100% identity to a target nematode specific polynucleotidesequence as set forth in SEQ ID NO:45569 through SEQ ID NO:50775.

The present invention also provides a recombinant DNA construct forexpression of a dsRNA-mediated gene silencing sequence in a plant cell,which comprises at least two different target sequences that, whenexpressed in vivo as RNA sequences and provided in the diet of a targetnematode pest, inhibit the expression of at least two different targetgenes in one or more cells or tissues of the target nematode pest. Afirst target sequence exhibits at least from about 75 to about 100percent identity to a first specific polynucleotide sequence region asset forth in SEQ ID NO:45569-SEQ ID NO:50775, and a second targetsequence that is different from the first target sequence exhibits atleast about 75 to about 100 percent identity to a second specificpolynucleotide sequence region as set forth in SEQ ID NO:45569 throughSEQ ID NO:50775, wherein the two or more target sequences are assembledin a DNA construct and expressed together as a single RNA transcript,and are constructed to form one or more dsRNA's useful in suppression ofthe one or more target genes. The DNA construct is transformed into asoybean cell, and the cell is regenerated into a recombinant plant. ThedsRNA molecules are thus provided in the diet of the target nematodepest in a target nematode pest inhibitory concentration. Ingestion bythe target nematode pest of recombinant plant cells or tissuesexpressing the recombinant dsRNA achieves the desired inhibition ofexpression of one or more target genes in the nematode, resulting in thefecundicity, morbidity, and/or mortality of the target nematode pest.

Another aspect of the present invention is the use of the nucleotidesequences disclosed herein to identify target sequences that occur inthe transcript RNA of other plant pests, in particular insect pests,fungal pests, and nematode pests, that could be targeted simultaneouslyand/or contemporaneously with a single expression construct designed tosuppress related genes in multiple plant pests by identifying sequencesof sufficient length and identity in two or more different plant pests,and ensuring that the DNA construct used to produce a recombinant plantexpresses one or more dsRNA molecules that function to effectivelysuppress one or more related genes in the two or more different plantpests. In particular, some contiguous nucleotide sequences equal to orgreater than about 21-24 nucleotides in length are identified herein tobe present within the genome of the soybean cyst nematode (SCN)Heterodera glycines and have been identified to be present as well inHeterodera schactii, and to some extent, other pest species as well,such as in several other nematode pest species as well as in other plantpest species such as specific insect nucleotide sequences, and in animalpest species such as insect pest species. Such sequences may be usefulfor effectively suppressing a target sequence in these plant pests,particularly when expressed as a dsRNA molecule in a recombinant plantcell that is provided in the diet of the pest, and can provideresistance to the plant from pest infestation from all or substantiallyall pests in which such sequences appear, in particular if the sequencesin common are within genes shown to be essential for survival,reproduction, mobility, and/or development and differentiation.

The target sequences disclosed in the present invention can be used toidentify related target sequences that occur in the transcript RNA ofother pest species, particularly nematode species including but notlimited to pests such as Heterodera species such as H. avenae, H.ciceri, H. crucifera, H. cyperi, H. fici, H. goettingiana, H.hordecalis, H. humuli, H. latipons, H. litoralis, H. medicaginis, H.mediterranea, H. oryzae, H. oryzicola, H. riparia, H. rostochinesis, H.salixophila, H. schachtii, H. sorghi, H. trifolii, H. turcomanica, andH. zeae, Meloidogyne species such as M. arenaria, M. chitwoodi, M.artiellia, M. fallax, M. hapla, M. javanica, M. incognita, M. microtyla,M. partityla, M. panyuensis, and M. paranaensis, Globodera species suchas G. pallida, G. rostochiensis, and G. tabacum, Pratylenchus speciessuch as P. brachyrus, P. crenatus, P. coffeae, P. magnica, P. neglectu,P. penetrans, P. scribneri, P. thornei, and P. vulnus. Other plant pestnematode species that are within the scope of the present inventioninclude but are not limited to Xiphinema species, Nacobbus species,Hoplolaimus species, Paratylenchus species, Rotylenchulus species,Criconemella species, Hemicycliophora species, Helicotylenchus species,Rotylenchus species, Belonolaimus species, Trichodorus species,Tylenchorhynchus species, Radopholus species, Longidorus species,Dolichodorus species, Aphenlenchoides species, Ditylenchus species,Anguina species, and Tylenchulus species. A DNA construct that expressesa dsRNA molecule in a plant cell that has a target sequence common tomultiple plant pests provides plant resistance to pest infestation fromeach pest containing such target sequences. A particular target sequencecan be amplified within a single dsRNA transcript, and can contain onlya single contiguous sequence of at least from about 17 to about 21 toabout 50 nucleotides in common between any combination of pests, or canbe comprised of a chimera consisting of various contiguous sequences ofat least from about 17 to about 21 to about 50 or more nucleotides, eachsuch contiguous sequence either being in common between two or morepests, or unique to only a single pest, such that the chimera, whenpresent as a dsRNA sequence and provided in the diet of any one or moreof the targeted pests, results in the effective control such one or morepests.

The present invention also provides a method for producing a transgenicplant by introducing into the genome of the plants' cells apolynucleotide sequence consisting of all or a portion of at least oneof the aforementioned SCN specific recombinant DNA sequences, linked tolinked substantially the complement of that sequence. Transgenic plantsare generated from the transformed plant cell, and progeny plants,seeds, and plant products, each comprising the polynucleotide sequence,are produced from the transgenic plants.

The methods and compositions of the present invention may be applied toany monocot and dicot plant, depending on the pest species to becontrolled and the host range of the nematode pest. Specifically, theplants are intended to comprise without limitation alfalfa, aneth,apple, apricot, artichoke, arugula, asparagus, avocado, banana, barley,beans, beet, blackberry, blueberry, broccoli, brussel sprouts, cabbage,canola, cantaloupe, carrot, cassaya, cauliflower, celery, cherry,cilantro, citrus, clementine, coffee, corn, cotton, cucumber, Douglasfir, eggplant, endive, escarole, eucalyptus, fennel, figs, gourd, grape,grapefruit, honey dew, jicama, kiwifruit, lettuce, leeks, lemon, lime,Loblolly pine, mango, melon, mushroom, nut, oat, okra, onion, orange, anornamental plant, papaya, parsley, pea, peach, peanut, pear, pepper,persimmon, pine, pineapple, plantain, plum, pomegranate, poplar, potato,pumpkin, quince, radiata pine, radicchio, radish, raspberry, rice, rye,sorghum, Southern pine, soybean, spinach, squash, strawberry, sugarbeet, sugarcane, sunflower, sweet potato, sweetgum, tangerine, tea,tobacco, tomato, turf, a vine, watermelon, wheat, yams, and zucchiniplants. Preferably, the present invention is related to a transgenicsoybean plant that contains in its genome a DNA construct that expressesa dsRNA molecule from any sequence of the present invention.

The invention also provides a computer readable medium having recordedthereon one or more of the sequences as set forth in SEQ ID NO:1 throughSEQ ID NO:171306 and, with reference to nucleotide sequences, thecomplements thereof, for use in a number of computer based applications,including but not limited to DNA identity and similarity searching,protein identity and similarity searching, transcription profilingcharacterizations, comparisons between genomes, and artificialhybridization analyses.

BRIEF DESCRIPTION OF THE SEQUENCES

SEQ ID NO:1-SEQ ID NO:45568 correspond to individual sequences(singletons) and assembled singletons forming contiguous overlappingsequences (contigs) derived from DNA sequence analysis of one or morelibraries produced from the genome of the soybean cyst nematode strainOP25.

SEQ ID NO:45569-SEQ ID NO:97729 correspond to sequences predicted toencode various proteins, tRNA's, rRNA's and the like, which wereidentified using the bioinformatics described herein as applied to SEQID NO:1-SEQ ID NO:45568, and are further defined in blocks of sequencescorresponding to coding sequences characterized as (a) essential to SCNsurvival (SEQ ID NO:45569-SEQ ID NO:50775) and (b) other codingsequences and elements (SEQ ID NO:50776-SEQ ID NO:97729); and where theessential sequences are further defined in blocks of sequencescorresponding to unigenes, EST's, or cDNA's which were (c) linkedthrough bioinformatics analyses described herein to counterpartsequences entirely or partially known in the art (SEQ ID NO:47644-SEQ IDNO:50775) and (d) unique sequences exhibiting no known relationship tosequences known in the art (SEQ ID NO:45569-47643).

SEQ ID NO:97730-SEQ ID NO:119145 correspond to sequences predicted tocomprise all or substantially all of one or more SCN promoter sequences.

SEQ ID NO:119146-SEQ ID NO:124352 correspond to amino acid sequencespredicted to be encoded from the (a) essential and (b) other codingsequences set forth in SEQ ID NO:45569-SEQ ID NO:97729, and are furtherdefined in blocks of sequences corresponding to (c) peptides essentialto SCN survival, as set forth in SEQ ID NO:121221-SEQ ID NO:124352, eachbased on one or more BLASTP relationship to one or more proteins knownto be essential to survival of C. elegans or other organisms (translatedfrom SEQ ID NO:47644-SEQ ID NO:50775), and (d) other peptides lackingany BLASTP relationship to proteins known in the art, as set forth inSEQ ID NO:119146-SEQ ID NO:121220 (translated from SEQ ID NO:45569-SEQID NO:47643).

DETAILED DESCRIPTION OF THE INVENTION

The following is a detailed description of the invention provided to aidthose skilled in the art in practicing the present invention. Those ofordinary skill in the art may make modifications and variations in theembodiments described herein without departing from the spirit or scopeof the present invention.

The inventors have discovered all or substantially all of thepolynucleotide sequences that comprise the genomic DNA obtained from thesoybean cyst nematode Heterodera glycines, aligned the sequences toderived large blocks of sequence corresponding to genomic contigs setforth herein, analyzed these contigs to identify and characterizeuntranslated regulatory sequences, for example, promoters, introns,transcriptional initiation sequences, and polyadenylation signals.Genomic polynucleic acid sequences encoding all or part of one or moreproteins and characterized as being essential for survival, such asamino acid sequences involved in various metabolic or catabolicbiochemical pathways, cell division, reproduction, energy metabolism,digestion, neurological function and the like, are identified in thegenomic DNA sequences disclosed in the present invention, and regions ofsuch sequences are demonstrated herein as being useful for selection foruse in preparing DNA constructs for use in transforming cells, and thatalso express double stranded RNA molecules from such constructs in thetransformed cells, and when provided in the diet of a target pest,whether artificial diet or a natural diet, especially a plant cell,plant tissues or other plant parts, such as leaves, roots, stems,flowers, fruits or seeds, results in the fecundicity, morbidity, and/ormortality of the pest.

As described herein, ingestion by a target nematode pest of compositionscontaining one or more dsRNA molecules, wherein at least one segment ofthe dsRNA molecule corresponds to a substantially identical segment ofRNA produced in the cells of the nematode, will result in death, growthinhibition, stunting, inhibition of maturation or fecundity of thenematode. These results indicate that a polynucleotide molecule, eitherDNA or RNA, derived from SCN can be used to design a DNA constructaccording to the methods of the present invention to express arecombinant gene product in a transgenic host cell. The host cell can betransformed to contain one or more of the polynucleotide moleculesderived from sequences disclosed herein. The DNA construct transformedinto the host cell transcribes one or more RNA sequences that form intoa dsRNA molecule in the cell or biological fluids within the transformedhost, thus making the dsRNA available for ingestion by nematode when itfeeds upon the transgenic host. Therefore, the transformed host cell nowcontains within its genome the genetic potential to defend the host celland its parents, siblings and children from attack by the nematode.

The present invention relates to genetic control of nematodeinfestations in host organisms. More particularly, the present inventionincludes the DNA constructs, selection of target polynucleotides andmethods for delivery of nematode control agents to a nematode. Thepresent invention provides methods for employing stabilized dsRNAmolecules in the diet of the nematode as a means for suppression oftargeted genes in the nematode, thus achieving desired control ofnematode infestations in, or about the host or symbiont targeted by thenematode. The preferred host is a plant wherein the plant is transformedwith a recombinant DNA construct that expresses recombinant stabilizeddsRNA, siRNA, and/or miRNA molecules. The recombinant DNA constructcomprises a nucleotide sequence that is transcribed into RNA by the hostcell. The term “recombinant DNA” or “recombinant nucleotide sequence”refers to DNA that contains a genetically engineered modificationthrough manipulation as a result of methods for mutagenesis, use ofrestriction enzymes, and thermal amplification methods, and the like.

The dsRNA molecules, siRNA molecules, and/or miRNA molecules of thepresent invention are homologous or complementary to at least about acontiguous 17-21 nucleotide sequence selected from the group consistingof SEQ ID NO:45569 through SEQ ID NO:97729. Isolated and purifiednucleotide sequences from a SCN are provided from a genomic libraryconstructed from polynucleotide sequences of the pest. The ingestion ofsuch nucleotide sequences results in the reduction or elimination of anessential gene product necessary for the nematode's growth anddevelopment or other biological function.

The present invention also contemplates a transformed plant cell andtransformed plants and their progeny. The transformed plant cells andtransformed plants express one or more of the dsRNA, siRNA, or miRNAsequences of the present invention from one or more of the DNA sequencesas set forth in SEQ ID NO:1-SEQ ID NO:45568 and SEQ ID NO:45569 throughSEQ ID NO:97729, or the complement thereof.

As used herein the words “gene suppression”, when taken together, areintended to refer to any method for reducing the levels of a geneproduct as a result of gene transcription to mRNA. Gene suppression isalso intended to mean the reduction of protein expression from a gene ora coding sequence including posttranscriptional gene suppression andtranscriptional suppression. Posttranscriptional gene suppression isintended to refer to that suppression mediated by the homology betweenall or a part of an RNA transcript transcribed from a gene or codingsequence targeted for suppression and the corresponding double strandedRNA used for suppression, and refers to the substantial and measurablereduction of the amount of available mRNA available in the cell forbinding by ribosomes. The transcribed RNA can be in the senseorientation to effect what is referred to as co-suppression, in theanti-sense orientation to effect what is referred to as anti-sensesuppression, or in both orientations producing a dsRNA to effect what isreferred to as RNA interference (RNAi). Transcriptional suppression isintended to refer to that suppression mediated by the presence in thecell of a dsRNA, a gene suppression agent, exhibiting substantialsequence identity to a promoter DNA sequence or the complement thereofto effect what is referred to as promoter trans suppression. Genesuppression may be effective against a native plant gene associated witha trait, e.g., to provide plants with reduced levels of a proteinencoded by the native gene or with enhanced or reduced levels of anaffected metabolite. Gene suppression can also be effective againsttarget genes in plant nematodes that may ingest or contact plantmaterial containing gene suppression agents, specifically designed toinhibit or suppress the expression of one or more homologous orcomplementary sequences in the cells of a nematode or other pest.

Post-transcriptional gene suppression by anti-sense or sense orientedRNA to regulate gene expression in plant cells is disclosed in U.S. Pat.Nos. 5,107,065, 5,759,829, 5,283,184, and 5,231,020. The use of dsRNA tosuppress genes in plants is disclosed in WO 99/53050, WO 99/49029, U.S.Patent Application Publication 2003/0175965 A1, and 2003/0061626 A1,U.S. patent application Ser. No. 10/465,800, and U.S. Pat. Nos.6,506,559, and 6,326,193.

A preferred method of post transcriptional gene suppression in plantsemploys both sense-oriented and anti-sense-oriented, transcribed RNA,which is stabilized, e.g., as a hairpin and stem and loop structure. Apreferred DNA construct for effecting post-transcriptional genesuppression is one in which a first segment transcribes an RNA moleculein an anti-sense orientation relative to the mRNA of the gene transcripttargeted for suppression, the first segment further linked to a secondsegment spacer region that is not homologous or complementary to thefirst segment, and linked to a third segment that transcribes an RNA,wherein a portion is substantially complementarity to the first segment.Such a construct would be expected to form a stem and loop structure byhybridization of the first segment with the third segment and a loopstructure forms comprising the second segment (WO94/01550, WO98/05770,US 2002/0048814A1, and US 2003/0018993A1).

As used herein, the term “nucleic acid”, “polynucleic acid”, or“polynucleotide” refers to a single or double-stranded polymer ofdeoxyribonucleotide or ribonucleotide bases (also referred to asnucleotides) read from the 5′ to the 3′ end. The polynucleic acid mayoptionally contain non-naturally occurring or altered nucleotide basesthat permit correct read through by a polymerase. The term “nucleotidesequence” or “polynucleic acid sequence” may refer to both the sense andantisense strands of a polynucleic acid molecule as either individualsingle strands or in the duplex. The term “ribonucleic acid” (RNA) isinclusive of RNAi (inhibitory RNA), dsRNA (double stranded RNA), siRNA(small interfering RNA), mRNA (messenger RNA), miRNA (micro-RNA), tRNA(transfer RNA, whether charged or discharged with a correspondingacylated amino acid), and cRNA (complementary RNA) and the term“deoxyribonucleic acid” (DNA) is inclusive of cDNA and genomic DNA andDNA-RNA hybrids. The words “nucleic acid segment”, “nucleotide sequencesegment”, or more generally “segment” will be understood by those in theart as a functional term that includes genomic sequences, ribosomal RNAsequences, transfer RNA sequences, messenger RNA sequences, operonsequences and smaller engineered nucleotide sequences or portionsthereof that control or affect the expression of a gene product or thatmay be adapted to express proteins, polypeptides or peptides. Apolynucleic acid may optionally contain naturally occurring or alterednucleotide bases that prevent polymerization by a first polymerasecopying the strand that contains such base(s), i.e., one or more basesthat cannot be templated by the first polymerase while polymerizing thenascent or growing strand, so that any nucleotide sequence extendingbeyond the non-templated base(s) results in a cohesive end that can beused to link the polynucleic acid to one or more other nucleic acidsequences linked to the complement of the cohesive end, resulting in achimeric nucleotide sequence. The naturally occurring or alterednucleotide base(s) can then be templated to link the fragmentscomprising the chimeric nucleotide sequence by exposing the chimera to asecond polymerase that recognizes the naturally occurring or alterednucleotide base(s) and copies that/those base(s) with fidelity (Jarrellet al. U.S. Pat. No. 6,358,712; Newton et al. 1993 21:1155-1162). Thismethod may be particularly useful when assembling multi-componentsequences for expression of an RNA sequence that folds into a dsRNAsequence and functions to suppress one or more genes in one or moretarget organisms.

As used herein, the term “nematode” refers to plant parasitic nematodes,in particular to members of the Tylenchoidea superfamily, and morespecifically to the Heteroderidae family of nematodes that include thecyst nematodes (including at least Heterodera and Globodera species) andthe rootknot nematodes (Meloidogyne species). More specifically toHeterodera species and even more specifically to Heterodera glycines,the soybean cyst nematode. Nematode species that were shown to havehomologous target sequences with H. glycines polynucleotides of thepresent invention were: rootknot nematode species—Meloidogyne speciessuch as M. arenaria, M. chitwoodi, M. artiellia, M. fallax, M. hapla, M.javanica, M. incognita, M. microtyla, M. partityla, M. panyuensis, andM. paranaensis; cyst nematode species—Heterodera species such as H.schachtii, Globodera species such as G. rostochiensis, G. pallida, andG. tabacum, Heterodera species such as H. trifolii, H. medicaginis, H.ciceri, H. mediterranea, H. cyperi, H. salixophila, H. zeae, H.goettingiana, H. riparia, H. humuli, H. latipons, H. sorghi, H. fici, H.litoralis, and H. turcomanica; lesion nematode species—Pratylenchusspecies such as P. scribneri, P. magnica, P. thornei, P. crenatus, P.brachyrus, P. vulnus, P. penetrans, P. coffeae, and P. neglectus; otherplant parasitic nematode species include: Hirschmanniella species,Radopholus species such as R. similis, and Pratylenchoid magnicauda.Animal intestinal parasitic nematode species for which polynucleotideshave been identified as a result of comparisons to the sequence datadisclosed herein include Ascaris lumbricoides, and Ascaris suum.

As used herein, a “pest resistance” trait is a characteristic of atransgenic plant, transgenic animal, transgenic host or transgenicsymbiont that causes the plant, animal, host, or symbiont to beresistant to attack from a pest that typically is capable of inflictingdamage or loss to the plant, animal, host or symbiont. Such pestresistance can arise from a natural genetic variation or more typicallyfrom incorporation of recombinant DNA that confers pest resistance. Fireet al. (U.S. Pat. No. 6,506,599) generically described inhibition ofpest infestation, and demonstrated gene suppression in the non-pestnematode species Caenorhabditis elegans. Similarly, Plaetinck et al. (US2003/0061626A1) suggests using dsRNA to inhibit gene function in avariety of nematode pests. Mesa et al. (US 2003/0150017 A1) describeusing DNA sequences to transform host cells to express dsRNA sequencesthat are substantially identical to target sequences in specificpathogens, and particularly describe constructing recombinant plantsexpressing such dsRNA sequences for ingestion by various plant pests,facilitating down-regulation of a gene in the genome of the pest, andimproving the resistance of the plant to the pest infestation. As usedherein, the term “expression” refers to the transcription and stableaccumulation of a nucleotide sequence comprising both sense andantisense RNA derived from the nucleic acid sequences disclosed in thepresent invention, whether or not the RNA sequence is capped, spliced,and polyadenylated and trafficked into the cytoplasm of the cell.Expression may also refer to translation of mRNA into a polypeptide orprotein. As used herein, the term “sense” RNA refers to an RNAtranscript corresponding to a sequence or segment that, when produced bythe target nematode, is in the form of a mRNA that is capable of beingtranslated into polypeptide by the target nematode cell. As used herein,the term “antisense RNA” refers to an RNA transcript that iscomplementary to all or a part of a mRNA that is normally produced in acell of a nematode. The complementarity of an antisense RNA may be withany part of the specific gene transcript, i.e., at the 5′ non-codingsequence, 3′ non-translated sequence, introns, or the coding sequence.As used herein, the term “RNA transcript” refers to the productresulting from RNA polymerase-catalyzed transcription of a DNA sequence.When the RNA transcript is a perfect complementary copy of the DNAsequence, it is referred to as the primary transcript or it may be anRNA sequence derived from post-transcriptional processing of the primarytranscript and is referred to as the mature RNA.

Exposure of a plant cyst forming nematode to the dsRNA, siRNA, or miRNAsequences of the present invention may occur during the nematodes'juvenile J2, J3, J4, adult female or adult male developmental stages.Exposure may occur as the J2 or male nematode is migrating through theplant vasculature, for example the cortical cells, or during or afterestablishment of a feeding site within syncytial cells. Exposure mayoccur by the production of the dsRNA in neighboring transfer-like cellswith movement into the feeding site. dsRNA, siRNA, or miRNA may enterthe nematode through a variety of means including, for example, throughthe stylet and pharnyx, the anus, the extratory duct, or amphidial andphasmid channels. dsRNA produced in the tissues of the feeding site mayenter the nematode by transport through the feeding tube (Hussey, R Sand Grundler et. al., 1998, Nematode parasitism of plants, Ch. 9, ThePhysiology and Biochemistry of Free-living and Plant-parasiticNematodes, eds R N Perry and D J Wright), directly from the cytoplasm,from extracellular regions, or from other plant compartments. Movementof dsRNA, siRNA, or miRNA into the nematode may require that the RNAexhibit a molecular weight of less than or substantially less than 25Kda (feeding tube size threshold). Creating an siRNA or miRNA in theplant that is bioavailable to the nematode may require preventing thesiRNA from entering or remaining within the plant RISC complex, aprotein complex well in excess of 25 KD. For example, this may beaccomplished through a number of means such as (1) by co-expressing asmall RNA-binding protein that exhibits a greater affinity for the plantRISC complex compared to the nematode specific siRNA, (2) by producingin the transgenic cell a nematode specific siRNA that is incompatiblewith the plant RISC complex yet functional in the nematode RISC complex,or (3) by down-regulating RISC complex expression in the feeding siteestablished by the nematode. Small RNA-binding proteins may be optimizedfor binding to a specific siRNA or miRNA by modifying amino acidresidues by phage display or other peptide selection methods.

As used herein, the phrase “inhibition of gene expression” or“inhibiting expression of a target gene in the cell of a nematode” mayrefer to the absence (or observable decrease) in the level of proteinand/or mRNA product from the target gene. In the event that a particulartranscript or translation product is not detectable, whether or not thelack of detection is a result of the expression of a dsRNA specificallydesigned to suppress the levels of such transcript or translationproduct, the phrase “inhibition of gene expression” or “inhibitingexpression of a target gene in the cell of a nematode” may refer to theobservation of a phenotypic effect or the lack thereof within the plantor within or about the target pest that feeds upon the transgenic plant.Specificity refers to the ability to inhibit the target gene withoutmanifest effects on other genes of the cell and without any effects onany gene within the cell that is producing the dsRNA molecule. Theinhibition of gene expression of one or more target genes in thenematode may result in novel phenotypic traits in the nematode.

Without limiting the scope of the present invention, there is provided,in one aspect, a method for controlling plant infestation by a nematodeor other plant pest using stabilized dsRNA strategies. The methodinvolves generating stabilized dsRNA molecules as one type of nematodecontrol agents, that when provided in the diet of the nematode, inducegene silencing. As used herein, the phrase “generating a stabilizeddsRNA molecule” refers to the methods of employing recombinant DNAtechnologies to construct a DNA nucleotide sequence that transcribes astabilized dsRNA. As used herein, the term “silencing” refers theeffective “down-regulation” of expression of one or more targetednucleotide sequences within one or more cells of a nematode or otherplant pest and, hence, the elimination of the ability of the targetednucleotide sequence(s) to cause its normal effect within the cell.

The present invention also provides in part a delivery system forproviding a nematode control agent to a nematode through exposure of thenematode to a host, such as a plant containing the one or more controlagents of the present invention by ingestion of the plants' cells or thecontents of those cells. One embodiment of the present inventionprovides for generating a transgenic plant cell or a plant that containsa recombinant DNA construct transcribing the stabilized dsRNA moleculesof the present invention. As used herein, the phrase “generating atransgenic plant cell or a plant” refers to the methods of employingrecombinant DNA technologies to construct a plant transformation vectortranscribing the stabilized dsRNA molecules of the present invention, totransform a plant cell or a plant with such vector, and to generate thetransformed plant cell or transgenic plant containing a part of thevector that transcribes the stabilized dsRNA molecules. In particular,the method of the present invention may comprise a recombinant DNAconstruct in a cell of a plant that results in dsRNA transcripts thatare substantially homologous to an RNA sequence expressed by anucleotide sequence contained within the genome of a nematode. Where thenucleotide sequence within the genome of a nematode comprises a geneessential to the viability and infectivity of the nematode, itsdown-regulation results in a reduced capability of the nematode tosurvive and/or infect and/or cause damage to host cells. Hence, suchdown-regulation results in a “deleterious effect” on the maintenance,viability, and infectivity of the nematode, in that it prevents orreduces the nematode's ability to feed off of and survive on nutrientsderived from the host cells. By virtue of this reduction in thenematode's viability and infectivity, resistance and/or enhancedtolerance to infection by a nematode or other plant pest is facilitatedin the cells of a plant.

It is envisioned that the compositions of the present invention can beincorporated within the seeds of a plant species either as a product ofexpression from a recombinant gene incorporated into the genome of theplant cells, or incorporated into a coating or seed treatment that isapplied to the seed before planting. A plant derived from a single plantcell transformed to contain a recombinant or heterologous gene isconsidered herein to be a transgenic event.

The present invention also includes seeds and plants having more thatone agronomically important trait. Such combinations are referred to as“stacked” traits. These stacked traits can include a combination oftraits that are directed at the same target nematode pest, or they canbe directed at different target nematode pests, or to one or more insectpests, or can provide herbicide tolerance to the plant, for exampletolerance to glyphosate herbicide. The stacked traits can be achieved bybreeding to plants that have the trait or by building a chimeric DNAconstruct that contains multiple plant expression cassettes andtransforming the expression cassettes into the genome of the plant.

Cells of a plant seed of the present invention may express one or moredsRNA's, the sequence of any one of which is derived from a targetsequence, i.e., a nematode specific sequence disclosed herein in SEQ IDNO:1-SEQ ID NO:45569, and also may express a nucleotide sequence thatprovides herbicide tolerance, for example, resistance to glyphosate,N-(phosphonomethyl) glycine, including the isopropylamine salt form ofsuch herbicide. Herbicides for which transgenic plant tolerance has beendemonstrated include but are not limited to: glyphosate, glufosinate,sulfonylureas, imidazolinones, bromoxynil, delapon, cyclohezanedione,protoporphyrionogen oxidase inhibitors, and isoxasflutole herbicides.Polynucleotide molecules encoding proteins involved in herbicidetolerance are known in the art, and include, but are not limited to apolynucleotide molecule encoding 5-enolpyruvylshikimate-3-phosphatesynthase, bromoxynil nitrilase, phytoene desaturase, norflurazon,acetohydroxyacid synthase and the bar gene for tolerance to glufosinateand bialaphos (U.S. Pat. Nos. 5,627,061, 5,633,435, 6,040,497,5,094,945, and 4,810,648).

As used herein, the term “pest control agent”, or “gene suppressionagent” refers to one or more particular RNA molecules consisting of afirst RNA segment and a second RNA segment that are complimentary toeach other and are linked by a third RNA segment. The complementaritybetween the first and the second RNA segments results in the ability ofthe two segments to hybridize in vivo and in vitro to form a doublestranded molecule, i.e., a stem comprising the first and the secondsegments linked together by the third segment which forms a loop betweenthe first and second segments, so that the entire structure forms into astem and loop structure. Structures consisting of a first and a secondsegment that hybridize more tightly to each other may form into astem-loop knotted structure. The first and the second segments, whenhybridized together, correspond invariably, and not respectively, to asense and an antisense sequence with respect to the target RNAtranscribed from the target gene in the target nematode that issuppressed by the ingestion of the dsRNA molecule, or ingestion of ansiRNA molecule derived from the dsRNA molecule. The pest control agentcan also be a substantially purified (or isolated) nucleic acid moleculeand more specifically nucleic acid molecules or nucleic acid fragmentmolecules thereof from a genomic DNA (gDNA) or cDNA library. Suchsubstantially purified molecules can be applied to a seed, whether aseed from a transgenic plant or otherwise, in the form of a seedtreatment, together with a pharmaceutically acceptable carrier forstabilizing the dsRNA molecules, resulting in the dsRNA beingbioavailable within a plant grown from the seed, or bioavailabilitywithin the rhizosphere of the root system of the plant grown from theseed. A seed may be treated with one or more agents, each exhibitingdifferent activities designed to provide the seed, the germinatingseedling, and the growing plant or root with one or more advantages incomparison to other plants, such as pest resistance, includingbacterial, fungal, and nematode resistance, fertilizers, growthstimulants, gene stimulants or suppressors, herbicide functions to whichthe seed, germ, and or roots and seedling are resistant, and the like.Alternatively, the fragments may comprise smaller dsRNA oligonucleotidescomprising from about 15 to about 750 or more consecutive nucleotidesselected from the group consisting of SEQ ID NO:1-SEQ ID NO:45569 andthe complements thereof, or from about 15 to about 30 nucleotides, orfrom about 21 to about 24 consecutive nucleotides. The pest controlagent may also refer to a DNA construct that comprises the polynucleicacid molecules or nucleic acid fragment molecules of the presentinvention and the DNA construct is a transgene incorporated into thegenome of a host cell. The pest control agent may further refer to aplant comprising such a DNA construct in its genome or in the genome ofa subcellular organelle that comprises the polynucleic acid molecules ornucleic acid fragment molecules described in the present invention. Themethod of the present invention provides for the production of a dsRNAtranscript, the nucleotide sequence of which is substantially homologousto a targeted RNA sequence encoded by a target nucleotide sequencewithin the genome of a target pest.

As used herein, the term “genome” as it applies to cells of a nematode,a plant pest, or a host encompasses not only chromosomal DNA foundwithin the nucleus, but organelle DNA found within subcellularcomponents of the cell. The sequences of the present invention, whenintroduced into plant cells, can therefore be either chromosomallyintegrated or organelle-localized. The term “genome” as it applies tobacteria encompasses both the chromosome and plasmids within a bacterialhost cell. The DNA's of the present invention introduced into bacterialhost cells can therefore be either chromosomally integrated, localizedto a plasmid, or to a viral vector capable of replication in thebacterial host.

In certain preferred embodiments expression of the gene targeted forsuppression in the plant pest is inhibited by at least about 10%, atleast about 33%, at least about 50%, at least about 80%, at least about90%, at least about 95%, or by at least about 99% or more within cellsof the nematode so a significant inhibition takes place. Significantinhibition is intended to refer to inhibition sufficient to result in adetectable phenotype (e.g., cessation of growth, paralysis, sterility,behavioral effects, second generation effects, effects observed onnematodes ingesting dsRNA or on their progeny, morbidity, or mortality,etc.) or a detectable decrease in RNA and/or protein corresponding tothe target gene being inhibited. Although in certain embodiments of theinvention inhibition occurs in substantially all cells of the nematode,in other preferred embodiments inhibition occurs in only a subset ofcells that are contacted with the dsRNA, or that are expressing thetarget gene transcript.

The advantages of the present invention may include, but are not limitedto the ease of introducing dsRNA into the nematode or other pest cells,the low concentration of dsRNA, siRNA, or miRNA which can be used, thestability of dsRNA, siRNA, or miRNA and the effectiveness of theinhibition. The present invention provides a method for selectingpolynucleotide sequences of a target gene sequence and is not limited toin vitro use of specific sequence compositions identified by the methodor to the set of exemplary target genes of the present invention.Segments of the nucleotide sequences of the present invention may beselected for their level of gene inhibition/suppression by scanningsegments of the H. glycines sequences to identify segments that exhibitpreferred levels of gene suppression or pest inhibition when provided asa dsRNA molecule in the diet of one or more target pests such as H.glycines.

As used herein, the term “sequence identity”, “sequence similarity” or“homology” is used to describe sequence relationships between two ormore nucleotide or amino acid sequences. The percentage of “sequenceidentity” between two sequences is determined by comparing two optimallyaligned sequences over a comparison window, wherein the portion of thesequence in the comparison window may comprise additions or deletions(i.e., gaps) as compared to the reference sequence (which does notcomprise additions or deletions) for optimal alignment of the twosequences. The percentage identity of a reference sequence to another iscalculated by determining the number of positions at which the referencesequence (whether nucleic acid or amino acid sequence) is identical toanother sequence to yield the number of matched positions, dividing thenumber of matched positions by the total number of positions in thewindow of comparison, and multiplying the result by 100 to yield thepercentage of sequence identity. A sequence that is identical at everyposition in comparison to a reference sequence is said to be, withrespect to a nucleotide sequence or amino acid sequence, identical tothe reference sequence and vice-versa. A first nucleotide sequence whenobserved in the 5′ to 3′ direction is said to be the “complement” of, orcomplementary to, a second or reference nucleotide sequence observed inthe 3′ to 5′ direction if the reverse complement of the first nucleotidesequence is identical at every nucleotide position with the second orreference sequence. As used herein, two nucleic acid sequence moleculesare said to exhibit “complete complementarity” when every nucleotide ofone of the sequences, when read 5′ to 3′, is complementary to everynucleotide of the other sequence when read 3′ to 5′. A nucleotidesequence that is complementary to a reference nucleotide sequence willexhibit a sequence identical to the reverse complement sequence of thereference nucleotide sequence.

In practicing the present invention, a target gene may be derived from anematode or other pest species that causes damage to one or moredifferent crop plants and/or yield losses to such plants. Severalcriteria may be employed in the selection of target genes. The gene maybe one whose protein product has a rapid turnover rate, so that dsRNAinhibition will result in a rapid decrease in protein levels. In certainembodiments it is advantageous to select a gene for which a smalldecrease in expression level results in deleterious effects for thepest. It may be desirable to target a broad range of nematode speciesand so a nucleotide sequence is selected that is highly conserved acrossthe targeted range of species. Conversely, for the purpose of conferringspecificity, in certain embodiments a nucleotide sequence is selectedthat contains regions that are poorly conserved between individualtargeted pest species, or between the targeted pest and other organisms.In certain embodiments it may be desirable to select a nucleotidesequence that exhibits no known homology to sequences in otherorganisms. As used herein, the term “derived from” refers to a specifiednucleotide sequence that may be obtained from a particular source orspecies.

Target genes for use in the present invention may include, for example,those that play important roles in the viability, growth, development,reproduction and infectivity of a particular pest. These target genesmay be one or more of any house keeping gene, transcription factor andpest specific gene that provides an observable phenotype, in particulara phenotype that results in the suppression of feeding on or theinability to utilize a transgenic soybean plant expressing a SCN deriveddsRNA as a nutrient source. For example, target genes that areanticipated herein to be effective in producing such phenotypes aresimilar to those that have been shown to affect the viability, growth,development, mobility, neurological stimulation, muscular function, andreproduction in C. elegans, including but not limited to the followingphenotypes: (Adl) adult lethal, (Age), (Bli) blistered, (Bmd) bodymorphology defect, (Ced) Cell death abnormality, (Clr) clear, (Daf)DAuer Formation, (Dpy) dumpy, (Egl) egg laying defect, (Emb) embryoniclethal, (Evl) everted vulva, (Fem) feminization of XX and XO animals,(Fgc) Fewer Germ Cells, (Fog) feminization of germline, (Gon) GONaddevelopment abnormal, (Gro) slow growth, (Him) high incidence of maleprogeny, (Hya) HYperActive, (Let) larval lethal, (Lin) lineage abnormal,(Lon) long body, (Lpd), (Lva) larval arrest, (Lvl) larval lethal, (Mab)Male ABnormal, (Mei) Defective meiosis, (Mig) MIGration of cellsabnormal, (Mlt) molt defect, (Morphology), (Mut) Mutator, (Muv)MUltiVulva, (Oma) Oocyte MAturation defective, (Pat) Paralyzed, Arrestedelongation at Two-fold, (Pch) PatCHy coloration, (Pnm) Pronuclearmigration alteration in early embryo, (Prl) paralyzed, (Prz) PaRaLyzed,(Pvl) protruding vulva, (Pvu) protruding vulva, (Rde), (Reproductive),(Rol) roller, (Rot) centrosome pair and associated pronuclear rotationabnormal, (Rup) exploded, (Sck) sick, (Sle) Slow embryonic development,(Slu) SLUggish, (Sma) small, (Spd) SpinDle, abnormal embryonic, (Spo)Abnormal embryonic spindle position and orientation, (Step) sterile,(Stp) sterile progeny, (Unc) uncoordinated, (Unclassified), (Vul)vulvaless, (WT), (defect) morphological or behavioral defects. SCNgenome sequences predicted to encode various gene products set forthherein annotated to the C. elegans specific genes previously shown toexert a negative effect or observable phenotype in Drosophila or in C.elegans are anticipated to be effective targets for achieving a similarphenotype when expressed in planta as a dsRNA for the purpose ofsuppressing a gene in SCN specifically targeted by the dsRNA. Genesequences unique to SCN and not annotated to sequences or gene productsfrom other organisms are also anticipated to be effective for achievingcontrol of SCN when such sequences are provided in the diet of the SCNas a dsRNA because the target genes are unique to SCN pest metabolism,physiology, and pathogenicity.

DNA segments of the present invention are desired for use inconstructing dsRNA expression sequences, particularly if the DNAsegments exhibit at least from about 70% identity, or at least fromabout 75% identity, or at least from about 80% identity, or at leastfrom about 90% identity, or at least from about 95% identity, or atleast from about 98% identity, or at least about 100% identity tocontiguous 17-24 nucleotide sequences found within the nematode genomeor other pest sequences targeted for suppression. Sequences less thanabout 80% identical to a target gene are anticipated to be lesseffective and so less desirable. Inhibition is specific to thenematodes' gene or gene families, the sequence of which correspondssubstantially to the dsRNA. Expression of unrelated genes is notaffected. This specificity allows the selective targeting of a nematodeor other pest species, resulting in the absence of an effect onnon-target organisms exposed to the compositions of the presentinvention.

The regions predicted to be more effective at dsRNA-mediated genesilencing include regions that exhibit higher siRNA efficiency. HighersiRNA efficiency may be achieved by any technique, including, but notlimited to, computational methods such as algorithms designed to predictsiRNA efficiency based on thermodynamic characteristics of a given dsRNA(or DNA) sequence, generally considering sequences of from about 17, toabout 18, to about 19, to about 20, to about 21, to about 22, to about23, or even to about 24 contiguous nucleotides corresponding to asequence that is being targeted for suppression (Schwarz et al., 200,Cell 115:199-208; Chalk et al. 2004, Biochem. Biophys. Res. Comm.319:264-274; Ui-Tei et al., NAR 2004, 32:936-948; Reynolds et al., 2004,Nature Biotechnology 22:326-330).

Inhibition of a target gene using the stabilized dsRNA technology of thepresent invention is sequence-specific in that nucleotide sequencescorresponding to the duplex region of the RNA are targeted for geneticinhibition. RNA containing a nucleotide sequences identical to a portionof the target gene is preferred for inhibition. RNA sequences withinsertions, deletions, and single point mutations relative to the targetsequence are also effective for gene specific inhibition. In performanceof the present invention, it is preferred that the inhibitory dsRNA andthe portion of the target gene share at least from about 75% sequenceidentity, or from about 80% sequence identity, or from about 90%sequence identity, or from about 95% sequence identity, or from about99% sequence identity, or even about 100% sequence identity.Alternatively, the duplex region of the RNA may be defined functionallyas a nucleotide sequence that hybridizes with a portion of the targetgene transcript. A greater sequence homology across a target genesequence that is less than full-length in comparison to the target genecompensates for a less homologous sequence that more closelyapproximates the full length of the target gene. The length of anucleotide sequence that is identical to a portion of the target genesequence can be from about 21, to about 25, to about 50, to about 100,to about 200, to about 300, or more contiguous bases. Normally, asequence of greater than 20-100 nucleotides is preferable, although asequence of greater than about 200-300 nucleotides may be preferred,depending on the length of the target gene. The invention has theadvantage of being able to tolerate sequence variations due to geneticmutation, strain polymorphism, or evolutionary divergence. Therefore thenucleic acid molecule introduced into a plant for expression as a pestspecific dsRNA gene suppression construct may not need to exhibitabsolute homology, and may not need to represent the full length of thesequence targeted for suppression.

The dsRNA molecules may be synthesized either in vivo or in vitro. ThedsRNA may be formed by a single self-complementary RNA strand or twocomplementary RNA strands expressed from separate expression constructs.Endogenous RNA polymerase of the cell may mediate transcription in vivo,or cloned RNA polymerase can be used for transcription in vivo or invitro. Inhibition may be achieved by specific transcription in an organ,tissue, or cell type; stimulation of an environmental condition (e.g.,infection, stress, temperature, chemical inducers); and/or engineeringtranscription at a developmental stage or age of the transgenic plantexpressing the dsRNA construct. The RNA sequences expressed from therecombinant construct may or may not be polyadenylated. The RNAsequences expressed from the recombinant construct may or may not becapable of being translated into a polypeptide by a cell's translationalapparatus.

The RNA, dsRNA, siRNA, or miRNA of the present invention intended foruse in controlling plant pest infestation may be produced chemically orenzymatically through manual or automated reactions or in vivo in anorganism other than the plant for which pest control is intended. RNAmay also be produced by partial or total organic synthesis. Any modifiedribonucleotide can be introduced by in vitro enzymatic or organicsynthesis. The RNA may be synthesized by a cellular RNA polymerase or abacteriophage RNA polymerase (e.g., T3, T7, SP6). If synthesizedchemically or by in vitro enzymatic synthesis, the RNA may be purifiedprior to introduction into the cell or formulated in an agronomicallyacceptable carrier and applied to the soil, to the roots, or to the seedprior to planting. For example, RNA can be purified from a mixture byextraction with a solvent or resin, precipitation, electrophoresis,chromatography, or a combination thereof. Alternatively, the RNA may beused with no, or a minimum of, purification to avoid losses due tosample processing. The RNA may be dried for storage or dissolved in anaqueous solution. The solution may contain buffers or salts to promoteannealing, and/or stabilization of the duplex strands.

For transcription from a transgene in vivo or from an expressioncassette, a regulatory region (e.g., promoter, enhancer, silencer,leader, intron and polyadenylation) may be used to modulate thetranscription of the RNA strand (or strands). Therefore, in oneembodiment, the polynucleotide sequences constructed to facilitatetranscription of the RNA molecules of the present invention are operablylinked to one or more promoter sequences functional in a plant host. Thepolynucleotide sequences may be placed under the control of anendogenous promoter normally present in the host genome. Thepolynucleotide sequences of the present invention, under the control ofan operably linked promoter sequence, may further be flanked byadditional sequences that advantageously affect its transcription and/orthe stability of a resulting transcript. Such sequences are generallylocated upstream of the promoter and/or downstream of the 3′ end of theexpression construct. The term “operably linked”, as used in referenceto a regulatory sequence and a structural nucleotide sequence, meansthat the regulatory sequence causes regulated expression of the linkedstructural nucleotide sequence. “Regulatory sequences” or “controlelements” refer to nucleotide sequences located upstream, within, ordownstream of a structural nucleotide sequence, and which influence thetiming and level or amount of transcription, RNA processing orstability, or translation of the associated structural nucleotidesequence. Regulatory sequences may include promoters, translation leadersequences, introns, enhancers, stem-loop structures, repressor bindingsequences, termination sequences, pausing sequences, polyadenylationrecognition sequences, and the like.

In another embodiment, the nucleotide sequence of the present inventioncomprises an inverted repeat sequence separated by a spacer sequence.The spacer sequence may be a region comprising any sequence ofnucleotides that facilitates secondary structure formation between theinverted repeat sequences. In one embodiment, the spacer sequence ispart of the sense or antisense polynucleotide sequence for mRNA. Thespacer sequence may alternatively comprise any combination ofnucleotides or homologues thereof that are capable of being linkedcovalently to a nucleic acid molecule. The spacer sequence may comprisea contiguous sequence of nucleotides of from about 8-100 nucleotides inlength, or alternatively from about 100-200 nucleotides in length, orfrom about 200-400 nucleotides in length, or from about 400-500nucleotides in length, or from about 500 to about 1500 nucleotides inlength.

The gene or genes targeted for suppression may be amplified using anythermal amplification means and the precise nucleotide sequencedetermined. One skilled in the art is able to modify the thermalamplification conditions in order to ensure optimal amplicon productformation, and the amplicon may be used as a template for in vitrotranscription to generate sense and antisense RNA with the includedminimal promoters.

As used herein, the phrase “a substantially purified nucleic acid”, “anartificial sequence”, “an isolated and substantially purified nucleicacid”, or “an isolated and substantially purified nucleotide sequence”,with respect to a naturally occurring nucleotide sequence, refers to anucleic acid molecule that is substantially removed from the compositionwith which it is associated in its natural state. Examples of asubstantially purified nucleic acid molecule include: (1) a DNA sequencecomprising the contiguous sequence at least about 17, or about 18, orabout 19 or more nucleotides in length consisting of a portion of anaturally occurring DNA molecule, but which is not flanked bypolynucleotide sequences occur naturally on either end of the contiguoussequence; (2) a nucleic acid molecule comprising a naturally occurringcontiguous nucleotide sequence isolated from its naturally occurringstate and incorporated into a DNA construct; (3) a cDNA, a genomic DNAfragment isolated and purified substantially from all other genomic DNAto which it was originally naturally associated, an amplicon fragmentproduced using thermal amplification procedures, or a restrictionfragment; (4) recombinant DNA; and (5) synthetic DNA. A substantiallypurified nucleic acid may also be comprised of one or more segments ofany of the sequences referred to hereinabove.

Nucleic acid molecules, fragments thereof, and complements thereofselected from the group consisting of SEQ ID NO:1-45568 may be employedas probes or primers to identify related nucleic acid molecules fromother species for use in the present invention to produce desired dsRNA,siRNA, and miRNA molecules. Such related nucleic acid molecules includethe nucleic acid molecules that encode the complete amino acid sequenceof a protein, and the promoters and flanking sequences of suchmolecules. In addition, such related nucleic acid molecules includenucleic acid molecules that encode gene family members. Such moleculescan be readily obtained by using the above-described nucleic acidmolecules or fragments thereof to screen complementary DNA or genomicDNA libraries obtained from a nematode or other plant pest species. Thescreen can be any physical means such as northern, southern, or anyimmunologically based screening method that detects either the specificsequence of a nucleotide molecule, or the transcribed and/or translatedproduct of such nucleotide molecule, or any mathematical algorithm thatis used for comparing nucleotide sequences in silico.

Nucleic acid molecules, fragments thereof, and complements thereofselected from the group consisting of SEQ ID NO:45569-SEQ ID NO:97729may also be used in a similar fashion to screen other genomes,libraries, and organisms for related sequences. Such related sequencesare expected to include but not be limited to homologues that includenucleic acid molecules that encode, in whole or in part, proteinhomologues of other pest species, plants or other organisms. Suchmolecules can be readily obtained by using the above-described nucleicacid molecules or fragments thereof to screen EST, cDNA or gDNAlibraries. Such homologous molecules may differ in their nucleotidesequences from those found in one or more of SEQ ID NO:1-SEQ ID NO:45568and SEQ ID NO:45569 through SEQ ID NO:97729 or complements thereof,because perfect complementarity is not required for such relatedsequences to hybridize to each other. In a particular embodiment,methods for 3′ or 5′ RACE may be used to obtain such sequences (Frohman,M. A. et al., Proc. Natl. Acad. Sci. (U.S.A.) 85:8998-9002, 1988; Ohara,O. et al., Proc. Natl. Acad. Sci. (U.S.A.) 86:5673-5677, 1989). Ingeneral, any of the above described nucleic acid molecules or fragmentsmay be used to generate dsRNA's, siRNA's, and/or siRNA's that aresuitable for use in a diet, in a spray-on mix, or in a recombinant DNAconstruct of the present invention.

As used herein, the phrase “coding sequence”, “structural nucleotidesequence” or “structural nucleic acid molecule” refers to apolynucleotide molecule that is translated into a polypeptide whenplaced under the control of appropriate regulatory sequences. Thestructural nucleotide sequence, coding sequence, or structural nucleicacid molecule can be referred to using other terms in the art, but isintended to include DNA as well as RNA molecules. A coding sequence caninclude, but is not limited to, genomic DNA sequences or portionsthereof identified to encode or to be capable of encoding a polypeptide,a cDNA produced as a result of reverse transcription of mRNA that hasbeen purified substantially because if its ability to hybridize to apolyT sequence, expressed sequence tagged (EST) sequences, andrecombinant nucleotide sequences produced specifically for expression ofa protein sequence.

Two molecules are said to be “minimally complementary” if they canhybridize to one another with sufficient stability to permit them toremain annealed to one another under at least conventional“low-stringency” conditions. Similarly, the molecules are said to becomplementary if they can hybridize to one another with sufficientstability to permit them to remain annealed to one another underconventional “high-stringency” conditions. Conventional stringencyconditions are described by Sambrook, et al., (1985). Appropriatestringency conditions which promotes hybridization of two differentnucleic acid sequences are, for example, incubation of the two sequencestogether in 6.0× sodium chloride/sodium citrate (SSC) at about 45° C.where one of the two different sequences is tethered in some fashion toa solid support and the untethered sequence is linked to a reportermolecule such as a ligand that can be detected using an immunologicalmeans, a fluorophores, a radioisotope, or an enzyme. The hybridizationof the two sequences under the above conditions can be followed by awash in 2.0×SSC at 50° C. to remove any excess reagents or unbound orunhybridized probe or untethered molecules (Current Protocols inMolecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6). Forexample, the salt concentration in the wash step can be selected from alow stringency of about 2.0×SSC at 50° C. to a high stringency of about0.2×SSC at 50° C. In addition, the temperature in the wash step can beincreased from low stringency conditions at room temperature (about 22°C.) to high stringency conditions (about 65° C.). Temperature and saltmay be varied together or independent of each other.

A nucleic acid for use in the present invention may specificallyhybridize to one or more of nucleic acid molecules from nematodes orcomplements thereof under moderately stringent conditions, for exampleat about 2.0×SSC and about 65° C. A nucleic acid for use in the presentinvention will include those nucleic acid molecules that specificallyhybridize to one or more of the nucleic acid molecules disclosed thereinas set forth in SEQ ID NO:1 through SEQ ID NO:47643 or complementsthereof under high stringency conditions. Preferably, a nucleic acid foruse in the present invention will exhibit at least from about 70%, atleast from about 80%, at least from about 90%, at least from about 95%,at least from about 98% or even about 100% sequence identity with one ormore nucleic acid molecules as set forth in SEQ ID NO:45569 through SEQID NO:47643.

Nucleic acids of the present invention may be entirely syntheticallyconstructed or assembled piecemeal from naturally occurring orcombinations of naturally occurring and synthetic components. All or anyportion of the nucleic acids of the present invention may be synthesizedwithout reference to codon usage calculated for any particular plantspecies, however when a particular sequence is intended to be effectivein suppression of one or more genes in one or more pest species, it ispreferable that the sequence be selected such that the sequence in anygene or species targeted for suppression be entirely or substantiallyentirely identical or entirely or substantially entirely complementaryto the suppressor sequence.

The present invention also relates to recombinant DNA constructs forexpression in a microorganism. Heterologous nucleic acids from which anRNA of interest is transcribed can be introduced into a microbial hostcell, such as a bacterial cell or a fungal cell, in order to producequantities of double stranded RNA for use in suppression of one or moregenes in one or more plant pests.

The present invention also contemplates transformation of apolynucleotide sequence of the present invention into a plant to achievenematode or other plant pest inhibitory levels of expression of one ormore dsRNA molecules. A plant transformation vector comprises one ormore nucleotide sequences that is/are capable of being transcribed as anRNA molecule and that is/are substantially homologous and/orcomplementary to one or more nucleotide sequences encoded by the genomeof the nematode, or other plant pest, such that upon uptake of the RNAmolecule, results in a down-regulation of expression of at least one ofthe respective nucleotide sequences of the nematode or other plant pest.In one embodiment the plant transformation vector is an isolated andpurified DNA molecule comprising a promoter operatively linked to acontiguous nucleotide sequence comprising one or more polynucleotidemolecules of the present invention selected from the group consisting ofSEQ ID NO:45569 through SEQ ID NO:50775. The polynucleotide moleculeincludes a segment comprising all or part of a RNA moleculecomplementary to a targeted RNA within a nematode or pest cell, and mayalso contain a functional intron sequence positioned either upstream ofor within the transcribed RNA sequence, and may also contain a fiveprime (5′) untranslated leader sequence (i.e., a UTR or 5′-UTR)positioned between the promoter and the point of transcriptioninitiation.

A plant transformation vector may contain sequences for suppression ofmore than one gene, thus allowing production of more than one dsRNA forinhibiting expression of two or more genes. One skilled in the art willreadily appreciate that segments of DNA whose sequence corresponds tothat present in different genes can be combined into a single compositeDNA segment for expression in a transgenic plant to achieve suppressionof one or more nematode or pest genes, one or more plant genes, or acombination thereof. Alternatively, a plasmid of the present inventionalready containing at least one DNA segment can be modified by thesequential insertion of additional DNA segments between an enhancerand/or promoter and the terminator sequences. A nematode or other plantpest control agent of the present invention may be designed for theinhibition of multiple genes, and the genes to be inhibited can beobtained from the same nematode or other plant pest species in order toenhance the effectiveness of the pest control agent, or from differentraces/variants of the same pest species, or from different pest speciesor other organisms. In certain embodiments, the genes derived fromdifferent nematodes or other plant pests provide for a broadening of therange of nematodes and other plant pests against which the pest controlagent is effective. When multiple genes in one pest are targeted forsuppression, a polycistronic DNA element can be fabricated (Fillatti, USPatent Application Publication No. US 2004-0029283 A1).

A promoter that drives expression of a polynucleotide sequence in aparticular species of plant is selected for use in expression constructsin which a nucleotide sequence of the present invention is to be used totransform a plant. Promoters that function in different plant speciesare known in the art. Promoters useful for expression of polypeptides inplants are those that are inducible, viral, synthetic, or constitutiveas described in Odell et al. (1985 Nature 313:810-812), and/or promotersthat are temporally regulated, spatially regulated, andspatio-temporally regulated. For the purpose of the present invention,e.g., for optimum control of species that feed on roots, it ispreferable to achieve the highest levels of expression of these geneswithin the roots of plants. A number of promoters exhibitingroot-enhanced levels of expression of operably linked sequences havebeen identified. (Lu et al., 2000 J. Plant Phys., 156(2):277-283; U.S.Pat. Nos. 5,837,848 and 6,489,542). Expression of the constructs of thepresent invention may preferably be from polymerase III promoters as analternative to conventional polymerase II promoters, and also may belinked to inducible promoters, or heterologous promoters that requireheterologous accessory proteins, such as for example, phage T7 promotersand the like. Promoters that are induced as a result of theestablishment by a cyst nematode of a feeding site (feeding sitespecific promoters), and promoters up-regulated by nematode invasion arespecifically contemplated for use in the present invention (Gheysen etal., 2002, Ann. Rev. Phytopathol. 40:191-219).

A recombinant DNA vector or construct of the present invention willtypically comprise a marker that confers a selectable phenotype ontransformed plant cells, and may also be used to select for plants orplant cells that contain the exogenous nucleic acids of the presentinvention. The marker may encode biocide resistance, antibioticresistance (e.g., kanamycin, G418 bleomycin, hygromycin, etc.), orherbicide resistance (e.g., glyphosate, etc.). Examples of selectablemarkers include, but are not limited to, a neo gene (Potrykus et al.,1985 Mol. Gen. Genet. 199:183-188) which codes for kanamycin resistanceand can be selected for using kanamycin, G418, etc.; a bar gene whichcodes for bialaphos resistance; a mutant EPSP synthase gene (Hinchee etal., 1988 Bio/Technology 6:915-922) which encodes glyphosate resistance;a nitrilase gene which confers resistance to bromoxynil (Stalker et al.,1988 J. Biol. Chem. 263:6310-6314); a mutant acetolactate synthase gene(ALS) which confers imidazolinone or sulphonylurea resistance (EuropeanPatent Application 154,204); an AMPA-acetyltransferase gene forresistance to phosphonates (U.S. Pat. No. 6,448,476), a methotrexateresistant DHFR gene (Thillet et al., 1988 J. Biol. Chem.263:12500-12508), and compositions for chloroplast or plastidtransformation selection (U.S. Pat. Nos. 5,693,507, 5,451,513, and WO95/24492).

A recombinant vector or construct of the present invention may alsoinclude a screenable marker for monitoring expression. Exemplaryscreenable markers include a β-glucuronidase or uidA gene (GUS) whichencodes an enzyme for which various chromogenic substrates are known(Jefferson, Plant Mol. Biol, Rep. 5.387-405, 1987; Jefferson et al.,EMBO J. 6:3901-3907, 1987); an R-locus gene, which encodes a productthat regulates the production of anthocyanin pigments (red color) inplant tissues (Dellaporta et al., Stadler Symposium 11:263-282, 1988); aβ-lactamase gene (Sutcliffe et al., Proc. Natl. Acad. Sci. (U.S.A.)75:3737-3741, 1978), a gene which encodes an enzyme for which variouschromogenic substrates are known (e.g., PADAC, a chromogeniccephalosporin); a luciferase gene (Ow et al., Science 234:856-859, 1986)a xylE gene (Zukowsky et al., Proc. Natl. Acad. Sci. (U.S.A.)80:1101-1105, 1983) which encodes a catechol dioxygenase that canconvert chromogenic catechols; an α-amylase gene (Ikatu et al.,Bio/Technol. 8:241-242, 1990); a tyrosinase gene (Katz et al., J. Gen.Microbiol. 129:2703-2714, 1983) which encodes an enzyme capable ofoxidizing tyrosine to DOPA and dopaquinone which in turn condenses tomelanin; an α-galactosidase, which catalyzes a chromogenic α-galactosesubstrate; and a β-galactosidase which catalyzes the conversion of achromogenic β-galactoside substrate.

In general a functional recombinant DNA is introduced at a non-specificlocation in a plant genome. In special cases it may be useful to inserta recombinant DNA construct by site-specific integration. Severalsite-specific recombination systems exist which are known to functionimplants include cre-lox as disclosed in U.S. Pat. No. 4,959,317 andFLP-FRT as disclosed in U.S. Pat. No. 5,527,695.

Preferred plant transformation vectors include those derived from a Tiplasmid of Agrobacterium tumefaciens (e.g. U.S. Pat. Nos. 4,536,475,4,693,977, 4,886,937, 5,501,967 and European Patent Application No.0122791). Agrobacterium rhizogenes plasmids (or “Ri”) are also useful.Other preferred plant transformation vectors include those disclosed,e.g., by Herrera-Estrella (1983 Nature 303:209-213), Bevan (1983 Nature304:184-187), Klee (1985 Bio/Technol. 3:637-642) and Eur. Pat Appl. No.EP0 120 516.

Methods and compositions for transforming plants by introducing arecombinant DNA construct into a plant genome includes any of a numberof methods known in the art. One method for constructing transformedplants is microprojectile bombardment as illustrated in U.S. Pat. Nos.5,015,580 (soy), 5,550,318 (corn), 5,538,880 (corn), 6,153,812 (wheat),6,160,208 (corn), 6,288,312 (rice) and 6,399,861 (corn). Another methodfor constructing transformed plants is Agrobacterium-mediatedtransformation in cotton (U.S. Pat. No. 5,159,135), corn (U.S. Pat. No.5,591,616), and soy (U.S. Pat. Nos. 5,824,877 & 6,384,301).

The term “transgenic plant cell” or “transgenic plant” refers to a plantcell or a plant that contains an exogenous or heterologouspolynucleotide sequence. A transgenic plant also comprises progeny(seeds, and plants and seeds produced from such seeds, etc.) of anygeneration of such a transgenic plant or a seed of any generation of allsuch transgenic plants wherein said progeny or seed comprises theexogenous or heterologous polynucleotide sequence. The heterologous orpolynucleotide sequence is a DNA molecule that is transcribed into theRNA, sRNA, dsRNA, siRNA, or miRNA or fragment thereof of the presentinvention.

A transgenic plant formed using Agrobacterium mediated transformationmethods contains at least a single recombinant DNA sequence insertedinto the plant chromosome and is referred to as a transgenic event. Suchtransgenic plants are referred to as being heterozygous for the insertedexogenous sequence. A transgenic plant homozygous with respect to atransgene can be obtained by sexually mating (selfing) an independentsegregant transgenic plant that contains a single exogenous genesequence to itself, for example an F0 plant, to produce F1 seed. Onefourth of the F1 seed produced will be homozygous with respect to thetransgene. F1 seed can be tested using a SNP or related thermalamplification assay that allows distinction between heterozygotes andhomozygotes (i.e., a zygosity assay).

Transgenic plants can also be prepared by crossing a first plant havinga recombinant DNA construct with a second plant lacking the construct.For example, a recombinant DNA designed for targeting the suppression ofa target gene can be introduced into a first plant line to produce atransgenic plant which can be crossed with a second plant line tointrogress the recombinant gene suppression DNA into the second plantline. The second plant line may already contain or be later transformedor bred with another transgenic line to contain one or more transgenesthat are different from the gene suppression construct beingintrogressed from the first plant line.

Without intending to be limited to any single embodiment, the nucleotidesequences of the present invention exhibit a wide variety of usefulness.For example, the sequences can be used to synthesize dsRNA moleculeseither in in vivo or in vitro systems selected for their ability tocause gene suppression and therefore pest inhibition and such moleculedcan be expressed in a transgenic plant, applied to the rhizosphere orbiosphere of a plant, or applied in a seed coating or treatment forcausing gene suppression in a pest. The sequences can be used in kitsincorporating methods for detecting DNA, RNA, or siRNA's in a seed,plant, tissue, biological sample, meal, oil, flour, food product,commodity product, and the like. The sequences can be used for detectingthe presence of a homologous sequence in a biological sample. Thesequences can be used to construct a dsRNA for suppression of a targetgene and can be linked to an RNA segment that binds specifically to oneor more receptor molecules, bringing the dsRNA segment into closeproximity to a membrane surface, and increasing its likelihood of beingtaken up by a cell which contains a gene that is targeted forsuppression by the dsRNA.

In one embodiment, a nucleotide sequence of the present invention can berecorded on one or more computer readable media. As used herein,“computer readable media” refers to any tangible medium of expressionthat can be read and accessed directly by a computer. Such mediainclude, but are not limited to: magnetic storage media, such as floppydiscs, hard discs, and magnetic tape. Optical storage media includephysical storage devices such as compact diskettes. Electrical storagemedia include random access and read only memory devices (RAM and ROM).A skilled artisan can readily appreciate that any of the presently knowncomputer readable mediums can be used to create a manufacture comprisinga computer readable medium having recorded thereon one or more sequencesof the present invention. These devices can be accessed with a computerand used to perform a search and comparison of any other sequence oflike composition (i.e., nucleotide sequences compared to nucleotidesequences, amino acid sequences compared to amino acid sequences, etc)to determine whether and to what extent a similarity or identity ispresent between the sequences being compared.

As used herein, “recorded” refers to a process for storing informationon computer readable medium. A skilled artisan can readily adopt any ofthe presently known methods for recording information on computerreadable medium to generate media comprising the nucleotide sequenceinformation of the present invention. A variety of data storagestructures are available for creating a computer readable medium havingrecorded thereon one or more sequences of the present invention. Thechoice of the data storage structure will generally be based on themeans chosen to access the stored information. In addition, a variety ofdata processor programs and formats can be used to store the sequenceinformation of the present invention on computer readable medium. Thesequence information can be represented in a word processing text file,formatted in commercially available software such as WordPerfect,Microsoft Word, or shareware such as Linux, or represented in the formof an ASCII text file, stored in a database application, such as DB2,Sybase, Oracle, or the like. The skilled artisan can readily adapt anynumber of data processor structuring formats (e.g. text file ordatabase) in order to obtain computer readable medium having recordedthereon the nucleotide sequence information of the present invention.

Computer software is publicly available which allows a skilled artisanto access sequence information provided in a computer readable medium.Software which implements the BLAST (Altschul et al., J. Mol. Biol. 215:403-410, 1990) and BLAZE (Brutlag, et al., Comp. Chem. 17: 203-207,1993) search algorithms on a Sybase system can be used to identify openreading frames (ORFs) within sequences such as the EST's that areprovided herein and that contain homology to ORFs or proteins from otherorganisms. Such ORFs are protein-encoding fragments within the sequencesof the present invention and are useful in producing commerciallyimportant proteins such as enzymes used in amino acid biosynthesis,metabolism, transcription, translation, RNA processing, nucleic acid anda protein degradation, protein modification, and DNA replication,restriction, modification, recombination, and repair.

EXAMPLES Example 1

This Example illustrates the construction and DNA sequence analysis ofSCN genome libraries.

SCN genomic DNA libraries (LIB5513, LIB 5514, LIB5519, and LIB5520) wereconstructed from SCN strain OP25 genomic DNA (Dong et al., 1997,Genetics 146:1311-1318). The libraries were generated by ligatingsize-selected physically sheared DNA into the high copy number plasmidpUC18 and the resulting ligation mixture was transformed into E. coli byelectroporation. 10 micrograms of SCN genomic DNA were resuspended into30 microliters TE buffer. The DNA was sheared by sonication. Thesonicated DNA was 5′ end-repaired using T4 DNA polymerase (New EnglandBioLabs) and 10 mM dNTP's in a total reaction volume of 35 microlitersand equilibrated to 1× ligation buffer (New England BioLabs). 3′overhangs were repaired by treatment with T4 polynucleotide kinase. Themixture was incubated at 15° C. for 20 minutes, and transferred to 65°C. for 15 minutes to inactivate the kinase and polymerase, and incubatedat room temperature for an additional 10 minutes. The repaired DNA wassize fractionated by electrophoresis in a 0.7% agarose gel adjacent to a1 Kb molecular weight marker at 80 volts for two hours in TBE buffer.The 2-4 KB and 4-8 KB DNA fragments were excised from the agarose geland transferred into microcentrifuge tubes. The size-selected DNAfragments were isolated from the agarose gel and a second round of sizeselection was performed to eliminate small DNA fragments co-migratingwith the selected range in first gel fractionation. Approximately 100nanograms of the size-selected repaired DNA was inserted by ligationinto a pUC18-HincII digested vector (molar ratio of 5 to 1). The ligatedDNA was transformed into E. coli DH10B cells by electroporation andplanted to LB plates containing 100 micrograms per milliliter ampicillinand incubated for 18-24 hours at 37 C. Several colonies that arose afterincubation were randomly selected. The colonies were tested to determinethe average DNA insert size and the average number of colonies in thelibrary that appeared to contain no inserted recombinant DNA. Fourlibraries were constructed. The average insert size in library LIB5513was 2-4 KB, in library LIB5514 was 4-8 KB, in library LIB5519 was 2-4KB, and in library LIB5520 was 4-8 KB. Samples of each library werecollected and combined together and deposited with the American TypeCulture Collection (ATCC) at Rockville Md., USA on Feb. 15, 2005. Thecombined library was submitted to ATCC, designated asLIB5513_(—)14_(—)19_(—)20, and the ATCC has assigned the patent depositnumber PTA-6583 to the deposited material.

The cells of the libraries were then plated on large bioassay platescontaining Luria Broth (Difco) supplemented with 100microgram/milliliter carbenicillin (ICN Biomedical), 64microgram/milliliter IPTG (Shelton Scientific) and 80microgram/milliliter X-Gal (Shelton Scientific). Individual bluetransformants were then picked into 1.2 ml Terrific Broth (Difco)supplemented with 125 μg/ml Ampicillin (Calbiochem) in 96 deep-wellboxes by Genetix Q-bot. The boxes were incubated for 21 hours at 37° C.,each well archived to individual wells in 384-well glycerol plates, andthen pelleted and stored at −20 C.

Alkaline lysis DNA extraction was performed on samples of pelletedclones using a QUIAGEN bead based platform on an automated roboticpreparation system. Eluted DNA was stored for sequencing at 4° C. in a96-well COSTAR plate. Two microliters of the DNA solution was thentransferred into a 384 well microtiter plate (AXYGEN) using a HamiltonMPH96 Pipetting Robot. The pipetted DNA was then denatured for 5 minutesat 95° C., and two microliters of Big Dye Reaction Mix (Big DyeTerminators v3.0, 3.2 pmol sequencing primer, 1×TNK, and 0.5M MgCl₂) wasthen added to the denatured DNA using a Hamilton MPH96 Pipetting Robot.Each clone was sequenced using M13 forward and reverse primers in a PCRsequencing reaction using the conditions as follows: 95° C. for 5seconds, 45° C. for 5 seconds, 60° C. for 2 minutes 30 seconds for atotal of 25 cycles. The sequencing reactions were ethanol precipitatedand re-suspended in water and loaded onto an ABI 3730×l SequencingAnalyzer (APPLIED BIOSYSTEMS) to generate sequence trace data for eachsample. Approximately 400,000 sequencing reads were generated from thefour Heterodera glycines genome libraries.

Example 2

This Example illustrates the analysis, characterization, and assembly ofthe sequences obtained from DNA sequence analysis of the SCN genomelibraries.

The sequence trace data was converted to sequence and quality files andstandard quality control procedures were applied through the use of theblock 0/1 pipelines. Quality control procedures included sequencequality trimming, sequence identity, cloning sequence removal, andcontamination identification and removal. The results of the generalsequencing pre-processing steps were stored in the sequence databaseSeqDB. Data passing quality controls were retrieved for inclusion in theassembly step. The dataset to be assembled consisted of 338,266 sequencereads that passed the block 0/1 process, represented by an initialoutput of 404,372 sequencing reads that were submitted to the block 0/1process. A file of clone pair constraints was produced on the basis ofknown clone naming conventions and library construction details (insertsize range). The clone pair constraint file consisted of 159,389pairwise entries. Fasta, quality, and constraint files were used asinput to the PCAP program (Version Date: Sep. 3, 2004, Huang, X., Wang,J., Aluru, S., Yang, S.-P. and Hillier, L. (2003): PCAP: A Whole-GenomeAssembly Program. Genome Research, 13: 2164-2170), and the sequenceswere assembled. 45,568 output genomic contig sequences were producedwhose sum length represented about 80.8 Million bases. These contigsequences are represented by the sequences as set forth in SEQ IDNO:1-SEQ ID NO:45568 and were subsequently used as input sequences todefine generic regions of the SCN genome sequence corresponding topredicted coding sequences (referred to herein as vcDNA's or virtualcomplementary DNA's) and predicted promoter and intronic sequences.

SCN expressed sequences were collected from public sources and used tocompare the genomic sequences identified herein as well as to identifyunique sequences not present in any known public database set. Publicsequences were collected into a file which contained non-identicalcontigs from (1) the Genome Sequencing Center at Washington Universityin St. Louis, Mo., USA (Nemagene clusters; McCarter et al., 2003, J.Nematology 35:465-469), (2) Parkinson contigs (Nembase clusters;Parkinson et al., 2004, Nature Genetics 36:1259-1267), (3) EST's inGenBank not contained in contigs (singletons), and (4) nucleotidesequences representing non-EST DNA sequences in GenBank (e.g., mRNAs).These sequences were compiled into and referred to herein as anessential gene sequence list corresponding to sequences as set forthherein at SEQ ID NO:47644-SEQ ID NO:50775.

Gene finding results were consolidated in a relational database in sucha way that each predicted gene is represented by a set of coordinatesthat define the position of all segments of the gene on the genomic DNAcontig (gDNA). The genes are described herein, and in particular in theFeature Fields of the Sequence Listing with reference to the nucleotidepositions of each vcDNA giving rise to an amino acid sequence and in theamino acid sequence SEQ ID NO's as nucleotide sequences corresponding toportions of vcDNA's encoding the amino acid sequence. Sequences betweenthe indicated protein coding portions correspond to predicted intronicsequences. Other sequence segments that are represented at least in thegenomic sequences set forth in SEQ ID NO:1-SEQ ID NO:45568 include butare not limited to peptide-encoding segments such as initial exon,internal exon, terminal exon, or single exon and the like, andnon-coding segments including promoter regions, transcription initiationsequences, transcription termination sequences, and polyadenylationsignal sequences, and the like. Often the same position within a gDNAcontig is predicted to contain a gene by more than one gene findingprogram. Thus, in order to prepare a library of genes where eachposition (locus) of the genome is represented by a single gene, severaldifferent gene prediction methods were applied and the results wereconsolidated according to the following algorithm.

1. For each gDNA contig, all clusters of overlapping genes were defined.Each cluster was assumed to correspond to a single gene. The cluster wasdefined as a set of sequences located on the same DNA strand and eitherpredicted to overlap based on nucleotide sequence identity along thelengths of the sequences or predicted to be located closer than 50nucleotides from each other. Only peptide-encoding segments wereconsidered when defining a cluster. The start and end positions of thecluster define the maximal dimension of the gene.

2. For each cluster the preferred gene was selected, which representsthis locus in the library. The selection algorithm is described asfollows:

(a) All genes in a cluster were ranked by the gene-prediction methodthat produced them. The ranking by the different methods was intended todescribe assumed accuracy of the method in predicting genes. The rankingwas ordered arbitrarily using FgeneSH, Genemark.hmm and AAT/NAP dataresults. The AAT/GAP results were not ranked at all, but were used onlyif there were no other prediction for the locus, i.e. cluster containedonly gene(s) predicted by AAT/GAP.

(b) The highest ranking gene was selected unless there were severalequally ranked genes (i.e. predicted by the same method) or the clustercoverage by this gene was below 60%. The cluster coverage was computedas the ratio of the gene length to the length of the cluster (maximaldimension of the gene).

(c) For equally ranked genes, the gene with highest cluster coverage wasselected.

(d) If the cluster coverage for the best-ranking gene was below 60%, thelower ranking genes were considered (in the ranking order) and the firstone providing a gain in cluster coverage of at least 10% was selected.

(e) If a cluster contained only AAT/GAP-predicted genes—the one with thebest cluster coverage was selected.

(f) For all other clusters, additional filtering was completed—onlysequences that exhibited a translation product of at least 16 aminoacids in length were selected.

(g) If a cluster contained only Genemark.hmm-predicted genes—no gene wasselected and the locus was assumed not to contain any gene.

The method described above resulted in a list of “preferred” genes. Theactual DNA sequence for each of these genes was prepared by extracting asubsequence (region of a sequence) of a gDNA contig which correspondedto the coordinates of the gene. The sequences prepared contained allpredicted exons and introns of the gene. In the case of MT/GAP andFgeneSH genes they also may contain regions between transcription andtranslation initiation sequences, and between translation terminationand polyadenylation sequences.

The three gene-predicting programs—FgeneSH, Genemark.hmm and AAT/NAP—inaddition to predicting positions of genes, also predict sequences of thetranslation product, if any. Thus, the “preferred” genes and theirtranslated peptide sequences were simultaneously predicted by thesemethods. Virtual cDNA sequences (vcDNA) were prepared from genes derivedonly from AAT/GAP prediction results by extracting regions of genomicDNA (gDNA) corresponding to the predicted exons and splicing themtogether. These virtual cDNA sequences were translated using atranslator tool. The feature fields of indicated peptide SEQ ID NO'sidentify genomic contig sequence positions (for example,Contig_ID=SeqID_XXX) for the coding sequence contained therein.Additional information provided in the feature fields includes theidentity of SCN-specific sequences, the nucleotide positions of thesesequences in the vcDNA sequence, homology to existing sequences inpublicly available databases, a numerical evaluation of the extent ofthe homology, and the predicted function if any associated with thepeptide.

The vcDNA sequences were used to identify sequences corresponding to SCNspecific promoter sequences using the following procedure:

1. For each gene predicted by either of the FgeneSH, Genemark.hmm orAAT/NAP prediction algorithms, the position of the firstpeptide-encoding segment was used as the reference point for sequenceextraction. The sequence of the gDNA contig which starts 1000nucleotides upstream and ends 2 nucleotides downstream of the referencepoint was extracted.

2. The resulting sequence of the upstream region was shorter if the genewas located closer than 1000 nucleotides to the end of the genomiccontig. If there was another gene located upstream and predicted by oneof these methods—FgeneSH, Genemark.hmm or AAT/NAP, the upstream regionwas shortened (truncated) so that it did not overlap with the closestpeptide-encoding segment of that gene. If the resulting sequence wasshorter than 50 nucleotides, it was not included as a promoter sequencein the library of promoter sequences.

3. If the resulting sequence did not end with the translation initiationcodon ATG, i.e., the predicted gene was not N-terminal complete—then thesequence was not included as a promoter sequence in the library ofpromoter sequences.

4. Sequences located upstream of AAT/GAP-predicted genes were notincluded in the library of promoter sequences since this program did notpredict a translation initiation position and in certain situationsplaced the predicted gene on the wrong strand of a gDNA contig.

Example 3

This Example illustrates the annotation of predicted SCN genes.

Two methodologies were used to provide annotations of the predictedHeterodera glycines (SCN) peptides, including Gene ontology (GO) andSmartBlast. Both GO and SmartBlast procedures were developed throughhomology-based sequence searches. In GO procedures, the peptidesequences from SCN peptides were used to BLAST against a proteinsequence database, for example, the non-redundant protein (nr-aa)database maintained by the National Center for Biotechnology Informationas part of GenBank. The highly conserved homologues of nr-aa from avariety of species were further selected with a minimal E value of1E-08. The selected SCN homologues were subjected to the sequence matchwith a protein sequence database (GO proteins from GO Ontologyconsortium). Finally, three categories according to the GO Ontologyconsortium (molecular function; biological process; and cellularcomponent) were used to annotate the SCN sequences. In SmartBlastprocedures, the peptide sequences from SCN peptides were also used toblast against the non-redundant protein as described above. Thehomologues were also selected with a minimal E value of 1E-08. Thosehomologues were subjected to filtering using some non-meaningful words,such as “putative”. The best meaningful homologues were used for SCNsequence annotation. The conditions used to provide the homologannotation and the best hit with respect to any predicted SCN geneproduct were referred to in one or more of the feature fields for eachof the SCN protein sequences selected from the group consisting of SEQID NO:119146-SEQ ID NO:121220, and were further identified as tomolecular function, enzyme activity, cellular component and biologicalprocess. Genes characterized as encoding proteins that may be essentialfor survival based on the proteins' relationship at least to one or moreC. elegans homologs and the phenotype of the knockout of the C. eleganshomolog were further identified in one or more of the feature fields ofeach of the peptide sequences. The phenotype observed, abbreviations foreach, and the standard nomenclature assigned for each with reference tothat same phenotype and nomenclature in C. elegans was identifiedpreviously hereinabove.

Example 4

This Example illustrates a method for screening the SCN genomesequences, the predicted vcDNA sequences, and the predicted amino acidsequence encoded therefrom, against other sequences and selectingsequences unique to SCN.

The sequences disclosed herein can be used in a method to provide a DNAconstruct for expression of a dsRNA that is effective for silencing of agene in a soybean cyst nematode or other plant pest by expressing suchDNA construct in the cells of a transgenic plant and providing the plantin the diet of the nematode or pest. DNA sequences can be selected fromthe sequences of the present invention that are useful in achievingdsRNA-mediated gene silencing by selecting from a target gene a DNAsequence consisting of at least from about 17 to about 21 or morecontiguous nucleotides. Effective short interfering RNA's (siRNAs) forgene repression are normally from about 21 to about 23-nt longdouble-stranded RNA duplexes. These siRNA's are known to incorporateinto the RNA-inducing silencing complex (RISC). Once unwound, thesingle-stranded antisense strand guides RISC to the target mRNA, andinduces the cleavage of the target messages, resulting in translationalinhibition (Dykxhoorn, et al. Molecular Cell Biology, 4:457-467, 2003).Plant siRNA sequences have been characterized generally as contiguousnucleotide sequences of from about 24 nucleotides in length (Tang, 2003,Genes & Development 17:49-63). It is preferred that interfering RNAmolecules are selected from the sequences as set forth in SEQ IDNO:1-SEQ ID NO:97729 to limit the un-intended “off-target” effect ofgene repression by limiting the potential base-pairing with unintendedtargets of the host or other non-target organisms.

Example 5

This example illustrates the identification of SCN genes that can betargeted for suppression using the nucleotide sequences of the presentinvention.

A comparison of the SCN genes was made to the genes identified in C.elegans for which knockouts have been previously identified to result inan observable phenotype. RNAi phenotypes include maternal sterile,embryonic lethal and a variety of postembryonic phenotypes. Therelationship between C. elegans knockout phenotypes and their proteinsequences were obtained. These protein sequences were then compared tothe protein sequences translated from the SCN genomic sequences of thepresent invention.

A BLAST searchable “All Protein Database” was constructed, which wascomposed of genome-wise SCN peptides and C. elegans proteins. Areciprocal blast procedure was used to identify the possible orthologuesof C. elegans for each SCN peptide.

The All Protein Database was queried using protein sequences of the SCNpeptides using the “blastp” algorithm with an E-value cutoff of 1e-8. Upto 1000 hits were retained for each SCN peptide used in the query, andseparated by organism names, either C. elegans or SCN. For C. elegans, alist was retained for the hits with SCN sequences exhibiting a moresignificant E-value than the best hit of the organism. The list containslikely duplicated SCN genes, and was referred to as a Core List. Anotherlist was retained for all the hits from each organism, sorted by theE-value, and was referred to as a Hit List. The hit was identified as anorthologue of the query sequence if it was within the Core List.

Knockout phenotypes of SCN were inferred according to the degree ofevolutionary relationship determined to exist between SCN and C. elegansproteins with reference to the knockout phenotypes of C. elegans genes,referred to herein above. For example, C. elegans C37H5.8 corresponds toa HSP-6 protein, and a knockout of this gene has been associated withthe observed phenotypes of embryonic lethality and larval arrest.Orthologue identification from the above query indicated that an SCNamino acid sequence corresponding to SEQ ID NO:119310 is an orthologueof C37H5.8. Therefore, it is believed that because of the relationshipof the SCN sequence corresponding to SEQ ID NO:119310 to the C. elegansorthologue C37H5.8, suppression of the SCN gene corresponding to SCNvcDNA sequence as set forth at SEQ ID NO:45733 encoding the C37H5.8orthologue at SEQ ID NO:119310 would be expected to result in anobservable phenotype corresponding to embryonic lethal and/or larvaarrest in SCN. SCN genes have been categorized based on theirrelationship to identifiable orthologues with genes or sequences inother organisms and some are further identified as essential genes. Suchinformation has been provided for each amino acid sequence predictedfrom the vcDNA sequences and is listed in the feature fields for eachsequence in the sequence listing. The feature field in the sequencelisting has been used to identify important features of the DNAmolecules of the present invention. A DNA construct that contains targetsequences from multiple SCN essential genes can be constructed toexpress a chimeric dsRNA molecule that affects more than one SCN gene.This aspect of the present invention reduces the possibility ofselecting for a population of SCN that is unaffected by the dsRNAmolecule.

SCN genes were grouped into Pfam protein families. Pfam is acomprehensive database of protein domain families, based on multiplealignments of protein domains or conserved protein regions (NucleicAcids Research 2004 32:D138-D141; Proteins 28:405-420, 1997.). Peptidesequences of a subset of SCN genes have been matched to Pfam entrieswith HMMPFAM program, with an expectation value cutoff of 0.1(Biological sequence analysis: probabilistic models of proteins andnucleic acids, Cambridge University Press, 1998.) The subset included5207 SCN protein sequences that were analyzed by this method, and 3397of the 5207 protein sequences were grouped into 909 families, as setforth in Table 1.

In order to target a protein gene family for suppression with a singledsRNA molecule, it may be necessary to identify conserved DNA sequenceregions among protein gene family members. After the amino acid sequencetranslations from the virtual cDNA sequences were grouped into proteinfamilies, the conserved sequence regions were identified throughmultiple sequence alignment of the DNA sequences of the family members.For example, using the program CLSUTALW (ref. Nucleic Acids Res.22:4673-4680), member sequences of a Pfam group can be aligned. Oneexample is illustrated by an alignment of SEQ ID NO's representative ofthe nucleotide sequences encoding the protein family members in theMRP_L47 family, a mitochondrial ribosomal protein family, correspondingto SEQ ID NO:49132 (HG02471), SEQ ID NO:50709 (HGC08009), and SEQ IDNO:46538 (HG2_(—)27019.C1.o1.np). An alignment of these three sequencesallows the identification of conserved contiguous residues present ineach of the three sequences. The conserved segments consisting of atleast 21 contiguous nucleotides are representative of the preferredpolynucleotide regions for expression in a double stranded RNA sequencefor use in targeting the suppression of each member of the entire genefamily. The comparison of protein sequences of family members identifiedand grouped in Table 1 enables the identification of relatedpolynucleotide regions common among the family members by locating thecorresponding cDNA and genomic contig sequences identified in thefeature field of the Sequence Listing. Using this method of comparison,the protein sequences of family members identified in Table 1 and in SEQID NO:119146-SEQ ID NO:124352 allows the skilled artisan to identify therelated polynucleotide regions that are common among the family membersby locating the corresponding virtual cDNA (vcDNA) and genomiccontiguous sequences as set forth in SEQ ID NO:1-SEQ ID NO:119145. Thesesequences can then be used in a DNA construct to express a dsRNAmolecule in plant cells that is directed to the suppression of one ormore genes in any of one or more plant pests. These polynucleotides canthen be used in a DNA construct to express a homologous dsRNA moleculein plant cells.

TABLE 1 SCN GENE FAMILIES and ANNOTATIONS SCN gene families Gene nameSCN gene family members Protein annotation bZIP_1 SeqID_122625SeqID_119811 bZIP transcription factor SeqID_124042 SeqID_124331SeqID_121799 Mito_carr SeqID_121264 SeqID_122047 Mitochondrial carrierprotein SeqID_122051 SeqID_122111 SeqID_122259 SeqID_122457 SeqID_122504SeqID_122594 SeqID_121347 SeqID_121360 SeqID_121361 SeqID_122965SeqID_122993 SeqID_123057 SeqID_123135 SeqID_123186 SeqID_123212SeqID_123240 SeqID_123284 SeqID_123348 SeqID_123379 SeqID_123391SeqID_123930 SeqID_119420 SeqID_119539 SeqID_120127 SeqID_120194SeqID_120280 SeqID_121563 SeqID_121585 SeqID_120988 SeqID_121115SeqID_123589 SeqID_123608 SeqID_123626 SeqID_123732 SeqID_123814SeqID_123863 SeqID_121678 SeqID_124088 SeqID_124151 SeqID_124350SeqID_121751 SeqID_121752 SeqID_121870 bZIP_2 SeqID_121260 SeqID_122546Basic region leucine zipper SeqID_122625 SeqID_119811 SeqID_124042SeqID_121799 Sec7 SeqID_120412 Sec7 domain MutS_IV SeqID_119581 MutSfamily domain IV CtaG_Cox11 SeqID_119768 Cytochrome c oxidase assemblyprotein Cta Synaptobrevin SeqID_122614 SeqID_122853 SynaptobrevinSeqID_123133 SeqID_123159 SeqID_123266 SeqID_120966 SeqID_121065SeqID_123723 Fer2 SeqID_122093 SeqID_122357 2Fe—2S iron-sulfur clusterbinding SeqID_119370 SeqID_124223 domain SeqID_124309 WD40 SeqID_121225SeqID_121265 WD domain, G-beta repeat SeqID_122030 SeqID_122050SeqID_122140 SeqID_122192 SeqID_122212 SeqID_122243 SeqID_122288SeqID_122350 SeqID_122349 SeqID_122434 SeqID_122449 SeqID_122509SeqID_122514 SeqID_122539 SeqID_122547 SeqID_121310 SeqID_121337SeqID_121375 SeqID_122883 SeqID_122959 SeqID_122996 SeqID_123045SeqID_123090 SeqID_123107 SeqID_123122 SeqID_123198 SeqID_123318SeqID_123329 SeqID_121391 SeqID_119248 SeqID_119272 SeqID_119279SeqID_119343 SeqID_119431 SeqID_119538 SeqID_119627 SeqID_119670SeqID_119769 SeqID_119826 SeqID_119840 SeqID_119913 SeqID_119990SeqID_121523 SeqID_120032 SeqID_120112 SeqID_120134 SeqID_120235SeqID_120302 SeqID_120344 SeqID_120362 SeqID_120457 SeqID_120458SeqID_120507 SeqID_120508 SeqID_120576 SeqID_120747 SeqID_120826SeqID_120866 SeqID_120885 SeqID_120945 SeqID_120994 SeqID_120997SeqID_121007 SeqID_121094 SeqID_121096 SeqID_121116 SeqID_121184SeqID_121199 SeqID_123563 SeqID_123612 SeqID_123643 SeqID_123841SeqID_123880 SeqID_121640 SeqID_123965 SeqID_124104 SeqID_124118SeqID_124150 SeqID_124186 SeqID_124334 SeqID_121790 SeqID_121804SeqID_121816 SeqID_121840 SeqID_121844 Skp1 SeqID_121262 SeqID_122339Skp1 family, dimerisation domain SeqID_121318 SeqID_119848 Fer4SeqID_122076 SeqID_122198 4Fe—4S binding domain SeqID_120449SeqID_120455 SeqID_124131 Enolase_C SeqID_122156 SeqID_123621 Enolase,C-terminal TIM barrel domain Mucin SeqID_120313 Mucin-like glycoproteinNHL SeqID_121326 NHL repeat FAT SeqID_122973 SeqID_120212 FAT domainIso_dh SeqID_122456 SeqID_120630 Isocitrate/isopropylmalate SeqID_123561dehydrogenase APH SeqID_122384 SeqID_122682 Phosphotransferase enzymeSeqID_121416 SeqID_119410 family SeqID_119453 SeqID_119764 SeqID_121622SeqID_120808 SeqID_124140 SeqID_121803 Suf SeqID_122937 Suppressor offorked protein (Suf) Enolase_N SeqID_122156 SeqID_123621 Enolase,N-terminal domain Ldh_1_C SeqID_122492 SeqID_120389 lactate/malatedehydrogenase, SeqID_123782 alpha/beta C-t HMG_CoA_synt SeqID_122417SeqID_120809 Hydroxymethylglutaryl-coenzyme SeqID_121837 A synthas PLDcSeqID_119325 SeqID_120704 Phospholipase D Active site motifGlycos_transf_1 SeqID_121164 Glycosyl transferases group 1Dala_Dala_lig_C SeqID_119260 D-ala D-ala ligase C-terminus Kunitz_BPTISeqID_122170 SeqID_119622 Kunitz/Bovine pancreatic trypsin SeqID_119623inhibito Nuc_sug_transp SeqID_119240 Nucleotide-sugar transporter cobWSeqID_122575 CobW/HypB/UreG, nucleotide- binding domain L15 SeqID_121270SeqID_122374 Ribosomal protein L15 SeqID_123713 Ldh_1_N SeqID_122492SeqID_121371 lactate/malate dehydrogenase, SeqID_122868 SeqID_120389 NADbinding do SeqID_123782 CLP_protease SeqID_122193 SeqID_122948 Clpprotease SeqID_124275 SeqID_120143 SeqID_123619 SeqID_121952 HEAT_PBSSeqID_122486 SeqID_120345 PBS lyase HEAT-like repeat SeqID_123693 NICSeqID_121904 Nucleoporin interacting component SecY SeqID_122609SeqID_123948 eubacterial secY protein PCI SeqID_122070 SeqID_122383 PCIdomain SeqID_119309 SeqID_119633 SeqID_120088 SeqID_120892 SeqID_123711SeqID_123975 SeqID_121783 Abhydro_lipase SeqID_119405 ab-hydrolaseassociated lipase region Mra1 SeqID_123459 Suppressor Mra1 tRNA-synt_1bSeqID_123017 SeqID_120260 tRNA synthetases class I (W and SeqID_121648Y) NIF SeqID_121832 NLI interacting factor-like phosphatase Laminin_G_1SeqID_120488 Laminin G domain tRNA-synt_1c SeqID_122217 SeqID_122354tRNA synthetases class I (E and SeqID_123425 SeqID_119559 Q), cataSeqID_120889 SeqID_123797 Laminin_G_2 SeqID_120488 Laminin G domaintRNA-synt_1e SeqID_119570 SeqID_119679 tRNA synthetases class I (C)catalytic d Acyl-CoA_dh_M SeqID_123412 SeqID_119407 Acyl-CoAdehydrogenase, middle SeqID_119554 domain Mak16 SeqID_123061SeqID_120231 Mak16 protein SeqID_120297 Clp1 SeqID_120900 Pre-mRNAcleavage complex II protein Clp1 Guanylate_cyc SeqID_119460 Adenylateand Guanylate cyclase catalyst Acyl-CoA_dh_N SeqID_122997 SeqID_123412Acyl-CoA dehydrogenase, N- SeqID_119407 SeqID_119740 terminal domaSeqID_124296 RNase_PH_C SeqID_122716 SeqID_120146 3′ exoribonucleasefamily, domain 2 MoeZ_MoeB SeqID_120491 MoeZ/MoeB domain Chitin_synth_2SeqID_120016 Chitin synthase PAP_central SeqID_12343 SeqID_119183Poly(A) polymerase central domain SeqID_120181 SeqID_120989 SeqID_121025SeqID_123810 rve SeqID_119251 SeqID_119810 Integrase core domain RED_NSeqID_120411 RED-like protein N-terminal region Ank SeqID_123070SeqID_119179 Ankyrin repeat SeqID_119568 SeqID_119583 SeqID_119584SeqID_119680 SeqID_119792 SeqID_119874 SeqID_120226 SeqID_121617 CKSSeqID_122543 SeqID_121066 Cyclin-dependent kinase regulatory subunitBand_7 SeqID_122693 SeqID_119534 SPFH domain/Band 7 family SeqID_123829PAF-AH_p_II SeqID_120737 SeqID_121196 Platelet-activating factoracetylhydrolas SF-assemblin SeqID_120078 SF-assemblin/beta giardinRibosomal_S24e SeqID_122807 SeqID_120291 Ribosomal protein S24eSeqID_124204 SeqID_124210 SeqID_121954 Ribosomal_S17e SeqID_122394SeqID_119329 Ribosomal S17 SeqID_123964 Sof1 SeqID_122192 SeqID_120865Sof1-like domain SeqID_120866 SeqID_124334 LrgB SeqID_120612 LrgB-likefamily DUF1650 SeqID_122945 SeqID_123077 Protein of unknown functionSeqID_120203 (DUF1650) Laminin_EGF SeqID_121392 SeqID_119297 LamininEGF-like (Domains III and SeqID_119335 SeqID_119714 V) SeqID_119816SeqID_120849 SeqID_120911 TruB_N SeqID_119847 TruB familypseudouridylate synthase (N term tRNA-synt_2b SeqID_122133 SeqID_122759tRNA synthetase class II core SeqID_119832 SeqID_120346 domain (G,SeqID_120388 SeqID_120642 SeqID_121608 SeqID_121085 SeqID_121645SeqID_124158 SeqID_124247 tRNA-synt_2c SeqID_120391 tRNA synthetasesclass II (A) Innexin SeqID_123073 SeqID_123480 Innexin SeqID_120601tRNA-synt_2d SeqID_122320 SeqID_119409 tRNA synthetases class II coreSeqID_121659 SeqID_124019 domain (F MFS_1 SeqID_119430 SeqID_119607Major Facilitator Superfamily SeqID_120612 Cyt-b5 SeqID_122136SeqID_122505 Cytochrome b5-like Heme/Steroid SeqID_119879 SeqID_123602bindin MAM33 SeqID_122105 SeqID_123650 Mitochondrial glycoprotein ZZSeqID_120217 Zinc finger, ZZ type Dpy-30 SeqID_122847 SeqID_121204Dpy-30 motif K_tetra SeqID_122369 SeqID_119930 K+ channeltetramerisation domain SeqID_120530 Tim44 SeqID_123264 SeqID_119402Tim44-like domain Mtap_PNP SeqID_121534 SeqID_121548 Phosphorylasefamily 2 SeqID_121569 PDZ SeqID_119401 SeqID_120527 PDZ domain (Alsoknown as DHR or GLGF) CHCH SeqID_123527 CHCH domain Ribonuc_red_smSeqID_122881 SeqID_120548 Ribonucleotide reductase, small SeqID_121571SeqID_121595 chain Pro_isomerase SeqID_12368 SeqID_122436 Cyclophilintype peptidyl-prolyl cis- SeqID_12839 SeqID_123273 tr SeqID_121384SeqID_119354 SeqID_120150 SeqID_123750 SeqID_123891 SeqID_121707SeqID_124224 DIX SeqID_123325 DIX domain Hydrolase SeqID_119435 haloaciddehalogenase-like hydrolase Peptidase_C1 SeqID_121248 SeqID_121247Papain family cysteine protease SeqID_121267 SeqID_122029 SeqID_122165SeqID_122555 SeqID_122593 SeqID_123403 SeqID_123446 SeqID_121447SeqID_119313 SeqID_123567 SeqID_123581 SeqID_123578 SeqID_121782SeqID_121853 SeqID_121879 Peptidase_C2 SeqID_123259 SeqID_120192 Calpainfamily cysteine protease SeqID_120374 E1-E2_ATPase SeqID_122184SeqID_119574 E1-E2 ATPase SeqID_119844 SeqID_120162 Peptidase_M13SeqID_122218 Peptidase family M13 FLYWCH SeqID_119876 SeqID_120405FLYWCH zinc finger domain Peptidase_M14 SeqID_122059 SeqID_122194 Zinccarboxypeptidase SeqID_119546 SeqID_123807 SeqID_124007 Sec62SeqID_121454 Translocation protein Sec62 Sec63 SeqID_120914 Sec63 domainPeptidase_M16 SeqID_123296 SeqID_119755 Insulinase (Peptidase familyM16) SeqID_124192 EGF SeqID_119714 SeqID_119985 EGF-like domainSeqID_120013 SeqID_120488 SeqID_120732 SeqID_120849 Ribonuc_red_IgCSeqID_121385 Ribonucleotide reductase, barrel doma UPF0027 SeqID_121387Uncharacterized protein family UPF0027 APC10 SeqID_122598 SeqID_119693Anaphase-promoting complex, SeqID_123584 subunit 10 (APC1 Integrin_alphaSeqID_119890 Integrin alpha cytoplasmic region Dynein_heavy SeqID_120763SeqID_120973 Dynein heavy chain Chromo SeqID_119766 SeqID_120561‘chromo’ (CHRromatin Organisation MOdifier) Surp SeqID_122195SeqID_120133 Surp module SeqID_124100 SeqID_124099 Lipase_GDSLSeqID_122975 SeqID_120739 GDSL-like Lipase/Acylhydrolase ASCSeqID_119784 Amiloride-sensitive sodium channel F-actin_cap_ASeqID_123125 SeqID_119252 F-actin capping protein alpha subunitRibosomal_L2_C SeqID_122387 SeqID_122576 Ribosomal Proteins L2,C-terminal SeqID_119709 SeqID_123662 doma SeqID_124015 SeqID_121759SeqID_121891 TPR_1 SeqID_122178 SeqID_122894 Tetratricopeptide repeatSeqID_123022 SeqID_119202 SeqID_119274 SeqID_119429 SeqID_119528SeqID_119687 SeqID_120074 SeqID_120605 SeqID_121168 SeqID_121175SeqID_123940 NTP_transf_2 SeqID_122343 SeqID_123810Nucleotidyltransferase domain TPR_2 SeqID_122178 SeqID_122894Tetratricopeptide repeat SeqID_123022 SeqID_119202 SeqID_119274SeqID_119429 SeqID_119528 SeqID_119687 SeqID_120074 SeqID_120591SeqID_120605 SeqID_120748 SeqID_120947 SeqID_121168 SeqID_121175SeqID_123940 TPR_4 SeqID_119202 Tetratricopeptide repeat COesteraseSeqID_122237 SeqID_119940 Carboxylesterase SeqID_120067 SeqID_120891SeqID_124012 TLE_N SeqID_123010 SeqID_119913 Groucho/TLE N-terminalQ-rich domain F-box SeqID_122666 SeqID_124313 F-box domain MRP-L47SeqID_122709 SeqID_120115 Mitochondrial 39-S ribosomal SeqID_124286protein L47 (MR Col_cuticle_N SeqID_121964 SeqID_121988 Nematode cuticlecollagen N- SeqID_121996 SeqID_122013 terminal do SeqID_122019SeqID_122027 SeqID_122291 SeqID_122304 SeqID_122313 SeqID_122472SeqID_122536 SeqID_122871 SeqID_122911 SeqID_122927 SeqID_122954SeqID_123029 SeqID_123087 SeqID_123176 SeqID_123223 SeqID_123423SeqID_119404 SeqID_119564 SeqID_119730 SeqID_119798 SeqID_120106SeqID_120175 SeqID_120474 SeqID_120705 SeqID_120750 SeqID_123663SeqID_121026 SeqID_121073 SeqID_123664 SeqID_123677 SeqID_123676SeqID_123769 SeqID_123783 SeqID_123827 SeqID_121953 SeqID_121957Na_H_Exchanger SeqID_120225 SeqID_120403 Sodium/hydrogen exchangerfamily ATP-synt_ab SeqID_122131 SeqID_122660 ATP synthase alpha/betafamily, SeqID_122734 SeqID_122935 nucleot SeqID_119160 SeqID_123656SeqID_123806 SeqID_123805 SeqID_124022 zf-B_box SeqID_120931 B-box zincfinger FMO-like SeqID_122732 SeqID_120410 Flavin-binding monooxygenase-SeqID_124057 like Ribosomal_S26e SeqID_122109 SeqID_120185 Ribosomalprotein S26e SeqID_123649 Ribosomal_S19e SeqID_122794 SeqID_123065Ribosomal protein S19e SeqID_123504 SeqID_124003 SeqID_124229SeqID_121859 Peptidase_C12 SeqID_122328 SeqID_120007 Ubiquitincarboxyl-terminal SeqID_121135 SeqID_124005 hydrolase, Peptidase_C13SeqID_122230 SeqID_121453 Peptidase C13 family SeqID_121467 SeqID_120219SeqID_120537 SeqID_121597 SeqID_124041 SeqID_121721 SeqID_121872SeqID_121883 Peptidase_C14 SeqID_123489 Caspase domain Peptidase_M24SeqID_122645 SeqID_124035 metallopeptidase family M24 SeqID_124033Ribosomal_L6e_N SeqID_122489 SeqID_123683 Ribosomal protein L6,N-terminal doma Paf1 SeqID_119976 Paf1 DUF1671 SeqID_123049 Protein ofunknown function (DUF1671) Complex1_51K SeqID_122912 SeqID_123104Respiratory-chain NADH SeqID_123995 dehydrogenase 51 Peptidase_M28SeqID_122327 SeqID_122621 Peptidase family M28 SeqID_119445 SeqID_123920PFK SeqID_120213 Phosphofructokinase DAGAT SeqID_123153 SeqID_120048Diacylglycerol acyltransferase SeqID_121202 RNA_pol_Rpb7_N SeqID_122067SeqID_121484 RNA polymerase Rpb7, N-terminal SeqID_121111 SeqID_123690domain DnaJ_C SeqID_121279 SeqID_122722 DnaJ C terminal regionSeqID_121405 SeqID_120879 SeqID_123882 zf-DNL SeqID_122584 SeqID_124310DNL zinc finger CNH SeqID_123483 SeqID_120283 CNH domain SeqID_121627DNA_ligase_A_M SeqID_123044 SeqID_119293 ATP dependent DNA ligase domainDNA_ligase_A_N SeqID_119293 DNA ligase N terminus LACT SeqID_119637Lecithin:cholesterol acyltransferase Ribosomal_L11_N SeqID_122466SeqID_122612 Ribosomal protein L11, N-terminal SeqID_122798 SeqID_120200dom SeqID_120321 SeqID_121603 SeqID_123787 SeqID_124218 Sec8_exocystSeqID_123021 Sec8 exocyst complex component specific Coatomer_ESeqID_122494 SeqID_120679 Coatomer epsilon subunit SeqID_123777 TT_ORF2SeqID_122828 SeqID_123933 TT viral ORF2 DNA_primase_S SeqID_122299SeqID_123968 DNA primase small subunit NACHT SeqID_122493 SeqID_119176NACHT domain Ribosomal_S27e SeqID_122380 SeqID_122513 Ribosomal proteinS27 SeqID_119700 SeqID_120329 SeqID_123636 SeqID_123738 Na_K-ATPaseSeqID_122309 SeqID_124043 Sodium/potassium ATPase beta chain TIP49SeqID_122138 SeqID_121376 TIP49 C-terminus SeqID_119157 SeqID_119793SeqID_123801 GNT-I SeqID_119837 GNT-I family Clathrin SeqID_119727Region in Clathrin and VPS MutS_V SeqID_119581 SeqID_119594 MutS domainV SeqID_120288 SeqID_120369 Acyl_CoA_thio SeqID_122480 SeqID_121327Acyl-CoA thioesterase SeqID_123795 SeqID_121650 SeqID_121919 PTPASeqID_121340 SeqID_123363 Phosphotyrosyl phosphate SeqID_119880activator (PTPA) pr UPF0113 SeqID_122707 Uncharacterised protein family(UPF0113) dsrm SeqID_119344 Double-stranded RNA binding motif Tom22SeqID_122077 SeqID_124129 Mitochondrial import receptor subunit Tom22EGF_CA SeqID_119714 SeqID_119985 Calcium binding EGF domain SeqID_120013SeqID_120732 lsy1 SeqID_122830 SeqID_120949 lsy1-like splicing familyELM2 SeqID_120201 SeqID_120354 ELM2 domain SeqID_121080 HA2 SeqID_122331SeqID_119298 Helicase associated domain (HA2) SeqID_120256 SeqID_120575SeqID_123546 SeqID_124253 RdRP SeqID_119588 RNA dependent RNA polymerase2-oxoacid_dh SeqID_119602 SeqID_121525 2-oxoacid dehydrogenasesSeqID_120978 acyltransferase UDPGP SeqID_123086 UTP--glucose-1-phosphateuridylyltransferase Arf SeqID_121288 SeqID_122204 ADP-ribosylationfactor family SeqID_122360 SeqID_122392 SeqID_122397 SeqID_122560SeqID_122575 SeqID_122579 SeqID_122619 SeqID_122671 SeqID_123132SeqID_123241 SeqID_123422 SeqID_119204 SeqID_119645 SeqID_119853SeqID_120523 SeqID_120531 SeqID_121613 SeqID_123568 SeqID_123726SeqID_123740 SeqID_123960 SeqID_123988 SeqID_123991 UDPGT SeqID_119517UDP-glucoronosyl and UDP- glucosyl transferas Cystatin SeqID_122804SeqID_120505 Cystatin domain SeqID_123974 ATP-synt_F6 SeqID_122448SeqID_120742 Mitochondrial ATP synthase SeqID_123680 coupling factoCyto_heme_lyase SeqID_122106 SeqID_124078 Cytochrome c/c1 heme lyaseDNA_pol_alpha_B SeqID_120691 DNA polymerase alpha subunit B ArmSeqID_122305 SeqID_122870 Armadillo/beta-catenin-like repeatSeqID_121423 SeqID_119427 SeqID_121148 SeqID_121201 SeqID_124047SeqID_121836 NTP_transferase SeqID_122569 SeqID_122914 Nucleotidyltransferase SeqID_120001 LSM SeqID_122052 SeqID_122121 LSM domainSeqID_122135 SeqID_122405 SeqID_122422 SeqID_122654 SeqID_122720SeqID_122860 SeqID_123047 SeqID_119181 SeqID_119424 SeqID_120049SeqID_120122 SeqID_121030 SeqID_123658 SeqID_123725 SeqID_123789SeqID_123944 SeqID_124195 NMT SeqID_120382 Myristoyl-CoA:protein N-myristoyltransferase TRAP-delta SeqID_122068 SeqID_120359Translocon-associated protein, SeqID_123786 delta subun DDOST_48kDSeqID_121541 SeqID_121553 Dolichyl- diphosphooligosaccharide-proteinRibosomal_S28e SeqID_122545 SeqID_120643 Ribosomal protein S28eSeqID_123620 UPF0120 SeqID_122142 SeqID_121137 Uncharacterised proteinfamily (UPF0120) Ala_racemase_N SeqID_121220 Alanine racemase,N-terminal domain MFAP1_C SeqID_119946 Micro-fibrillar-associatedprotein 1 C-termi Aminotran_3 SeqID_119827 Aminotransferase class-IIIACBP SeqID_121287 SeqID_119464 Acyl CoA binding protein PHD SeqID_123154SeqID_119379 PHD-finger SeqID_119449 SeqID_119766 SeqID_120788Aminotran_4 SeqID_122578 SeqID_119653 Aminotransferase class IVSeqID_119738 E3_binding SeqID_121975 SeqID_120333 e3 binding domainRibosomal_L37ae SeqID_122763 SeqID_119302 Ribosomal L37ae protein familySeqID_120838 SeqID_123781 SeqID_124249 zf-CCCH SeqID_121226 SeqID_122128Zinc finger C-x8-C-x5-C-x3-H type SeqID_122531 SeqID_122888 (and similSeqID_123330 SeqID_119331 SeqID_119820 SeqID_120375 SeqID_120658SeqID_121599 SeqID_121064 SeqID_123669 HMG-CoA_red SeqID_120545Hydroxymethylglutaryl-coenzyme A reductus CRAL_TRIO SeqID_120518CRAL/TRIO domain HAT SeqID_120676 HAT (Half-A-TPR) repeat Asn_synthaseSeqID_120554 SeqID_120715 Asparagine synthase PDCD9 SeqID_122558SeqID_123575 Mitochondrial 28S ribosomal protein S30 (PDC zf-CSLSeqID_122103 SeqID_120511 CSL zinc finger SeqID_123712 UBACTSeqID_122346 SeqID_121362 Repeat in ubiquitin-activating SeqID_121432SeqID_121452 (UBA) protein SeqID_119364 SeqID_124102 SeqID_121935Biotin_carb_C SeqID_120976 Biotin carboxylase C-terminal domain FrizzledSeqID_120827 Frizzled/Smoothened family membrane region FH2 SeqID_120125Formin Homology 2 Domain Coatomer_WDAD SeqID_119681 SeqID_120257Coatomer WD associated region zf-RanBP SeqID_123008 SeqID_121015Zn-finger in Ran binding protein and oth COQ7 SeqID_122535 SeqID_119203Ubiquinone biosynthesis protein SeqID_123627 COQ7 Ammonium_transpSeqID_120017 Ammonium Transporter Family PTR2 SeqID_119430 SeqID_120270POT family MPPN SeqID_119150 MPPN (rrm-like) domain Pyr_redoxSeqID_119535 SeqID_120156 Pyridine nucleotide-disulphide oxidoreductaCarn_acyltransf SeqID_119995 Choline/Carnitine o- acyltransferase TUDORSeqID_121121 Tudor domain Krr1 SeqID_120316 Krr1 family Asp SeqID_122475SeqID_123036 Eukaryotic aspartyl protease SeqID_123118 SeqID_123359SeqID_121455 SeqID_120476 SeqID_120494 SeqID_123971 BACK SeqID_120833SeqID_121637 BTB And C-terminal Kelch Uso1_p115_head SeqID_122989Uso1/p115 like vesicle tethering pro ATP-synt_ab_C SeqID_122371SeqID_122660 ATP synthase alpha/beta chain, C SeqID_123656 SeqID_123806termin SeqID_123805 Thiolase_C SeqID_122429 SeqID_122772 Thiolase,C-terminal domain SeqID_120310 SeqID_123864 5_nucleotid SeqID_120648 5′nucleotidase family FHA SeqID_120176 SeqID_120836 FHA domain PIDSeqID_123247 Phosphotyrosine interaction domain (PTB/PID) Citrate_syntSeqID_122637 SeqID_122705 Citrate synthase SeqID_120172 SeqID_124243Helicase_C SeqID_122054 SeqID_122318 Helicase conserved C-terminalSeqID_122331 SeqID_122336 domain SeqID_122624 SeqID_122677 SeqID_119233SeqID_119298 SeqID_119324 SeqID_119620 SeqID_119843 SeqID_119917SeqID_120153 SeqID_120168 SeqID_120256 SeqID_120323 SeqID_120342SeqID_120575 SeqID_120689 SeqID_120814 SeqID_120914 SeqID_121032SeqID_123546 SeqID_121674 SeqID_124119 SeqID_124137 SeqID_124148SeqID_124325 Neur_chan_LBD SeqID_121005 SeqID_121087Neurotransmitter-gated ion- channel lig Ion_trans_2 SeqID_121451SeqID_119613 Ion channel SeqID_119930 SeqID_119939 Myosin_tail_1SeqID_123386 Myosin tail TMS_TDE SeqID_122691 SeqID_119438 TMS membraneprotein/tumour SeqID_124263 differentially e P16-Arc SeqID_119921 ARP2/3complex 16 kDa subunit (p16-Arc) ATP-synt_ab_N SeqID_122110 SeqID_122131ATP synthase alpha/beta family, SeqID_122660 SeqID_119160 beta-baSeqID_123806 SeqID_123811 SeqID_124022 Endonuclease_NS SeqID_123055DNA/RNA non-specific endonuclease Thiolase_N SeqID_122429 SeqID_122772Thiolase, N-terminal domain SeqID_121350 SeqID_122890 SeqID_122953SeqID_123288 SeqID_123365 SeqID_121440 SeqID_121529 SeqID_120310SeqID_123593 SeqID_123864 Glycogen_syn SeqID_121477 SeqID_120492Glycogen synthase BTB SeqID_119616 SeqID_120373 BTB/POZ domainSeqID_121165 SeqID_121637 DUF236 SeqID_119224 SeqID_119247 Protein ofunknown function SeqID_120712 SeqID_120713 Ubiq_cyt_C_chap SeqID_122228SeqID_121449 Ubiquinol-cytochrome C SeqID_119888 SeqID_123755 chaperoneRenin_r SeqID_122129 SeqID_121442 Renin receptor-like proteinSeqID_123862 Cwf_Cwc_15 SeqID_122116 SeqID_120731 Cwf15/Cwc15 cell cyclecontrol SeqID_121658 SeqID_124014 protein A_deaminase SeqID_121003Adenosine/AMP deaminase DUF164 SeqID_121260 SeqID_123386 UncharacterizedACR, COG1579 ATP-synt_C SeqID_122557 SeqID_122592 ATP synthase subunit CSeqID_122805 SeqID_123522 SeqID_119902 SeqID_119958 SeqID_119959SeqID_120137 SeqID_124207 SeqID_124336 ATP-synt_D SeqID_122796SeqID_123297 ATP synthase subunit D SeqID_123366 SeqID_123854 LAG1SeqID_121272 SeqID_122595 Longevity-assurance protein SeqID_121367(LAG1) ATP-synt_E SeqID_123534 ATP synthase E chain ATP-synt_FSeqID_122088 SeqID_121348 ATP synthase (F/14-kDa) subunit SeqID_121745SeqID_121827 SeqID_121881 Filamin SeqID_120307 SeqID_120510Filamin/ABP280 repeat MazG SeqID_119952 MazG nucleotidepyrophosphohydrolase domain ATP-synt_G SeqID_122669 SeqID_119783Mitochondrial ATP synthase g SeqID_124110 subunit EF1_GNE SeqID_121982SeqID_122152 EF-1 guanine nucleotide exchange SeqID_122572 SeqID_119628domain SeqID_119899 SeqID_123606 FAD_binding_1 SeqID_122094 SeqID_122444FAD binding domain SeqID_123823 FAD_binding_2 SeqID_120807 FAD bindingdomain Dynein_light SeqID_122635 SeqID_123252 Dynein light chain type 1SeqID_119812 SeqID_123564 FAD_binding_3 SeqID_122411 SeqID_123933 FADbinding domain Astacin SeqID_122226 SeqID_122581 Astacin (Peptidasefamily M12A) SeqID_123481 SeqID_119601 SeqID_120015 SeqID_120571SeqID_120933 SeqID_121174 SeqID_121216 SeqID_124300 Tfb4 SeqID_122561SeqID_123565 Transcription factor Tfb4 Adaptin_N SeqID_120538SeqID_120950 Adaptin N terminal region EFG_IV SeqID_121397 SeqID_120540Elongation factor G, domain IV S-AdoMet_synt_C SeqID_123143 SeqID_119596S-adenosylmethionine synthetase, C-te Ribosomal_S10 SeqID_122673SeqID_119858 Ribosomal protein S10p/S20e SeqID_123703 SeqID_121948Ribosomal_S11 SeqID_122146 SeqID_123024 Ribosomal protein S11SeqID_123430 SeqID_120073 SeqID_120744 SeqID_123697 SeqID_124177Ribosomal_S12 SeqID_122060 SeqID_123235 Ribosomal protein S12SeqID_123361 SeqID_123428 SeqID_123434 SeqID_123629 SeqID_124279UCR_hinge SeqID_122302 SeqID_123516 Ubiquinol-cytochrome C reductaseSH3_1 SeqID_123228 SeqID_119323 SH3 domain SeqID_120958 Ribosomal_S13SeqID_122782 SeqID_122809 Ribosomal protein S13/S18 SeqID_123978SeqID_124200 SAP SeqID_120587 SAP domain SH3_2 SeqID_123228 SeqID_120958Variant SH3 domain Ribosomal_S14 SeqID_122064 SeqID_122107 Ribosomalprotein S14p/S29e SeqID_120012 SeqID_121048 SeqID_121107 SeqID_123630SeqID_123890 Brix SeqID_121565 SeqID_123250 Brix domain SeqID_119314SeqID_120464 SeqID_120490 SeqID_121589 SeqID_121109 Ribosomal_S15SeqID_122373 SeqID_123410 Ribosomal protein S15 SeqID_123433SeqID_121412 SeqID_123718 Ras SeqID_121288 SeqID_122204 Ras familySeqID_122360 SeqID_122392 SeqID_122397 SeqID_122560 SeqID_122575SeqID_122579 SeqID_122619 SeqID_122671 SeqID_122838 SeqID_122872SeqID_122903 SeqID_123052 SeqID_123189 SeqID_123241 SeqID_123414SeqID_123422 SeqID_119204 SeqID_119290 SeqID_119645 SeqID_119708SeqID_119885 SeqID_119891 SeqID_120379 SeqID_120664 SeqID_121613SeqID_120982 SeqID_123605 SeqID_121093 SeqID_123568 SeqID_123726SeqID_123740 SeqID_121691 SeqID_123960 SeqID_123988 SeqID_123991SeqID_121918 zf-TAZ SeqID_120357 TAZ zinc finger Ribosomal_S17SeqID_122602 SeqID_122629 Ribosomal protein S17 SeqID_119883SeqID_123918 SeqID_123925 Ribosomal_S18 SeqID_122652 Ribosomal proteinS18 ELK SeqID_120763 ELK domain Ribosomal_S19 SeqID_122056 SeqID_123333Ribosomal protein S19 SeqID_123450 SeqID_123685 C2 SeqID_119599SeqID_119684 C2 domain SeqID_120248 eIF-5_eIF-2B SeqID_123321SeqID_119265 Domain found in IF2B/IF5 SeqID_120603 SeqID_121769 VWDSeqID_122463 SeqID_123112 von Willebrand factor type D SeqID_119726SeqID_123592 domain GHMP_kinases SeqID_123199 GHMP kinases putative ATP-PAP_RNA-bind SeqID_120989 Poly(A) polymerase predicted RNA binding C4SeqID_119273 SeqID_119634 C-terminal tandem repeated domain in tP_proprotein SeqID_121230 SeqID_122663 Proprotein convertase P-domainSeqID_123014 SeqID_123393 ELO SeqID_122222 SeqID_122315 GNS1/SUR4 familySeqID_119651 SeqID_121493 SeqID_120991 SeqID_124009 SeqID_124028 p450SeqID_120075 SeqID_120579 Cytochrome P450 Complex1_30kDa SeqID_121297SeqID_119579 Respiratory-chain NADH dehydrogenase, wnt SeqID_122540 wntfamily OATP SeqID_119717 Organic Anion Transporter Polypeptide (OATP)RIIa SeqID_122847 SeqID_121204 Regulatory subunit of type II PKAR-subunit Pyrophosphatase SeqID_122209 SeqID_123850 Inorganicpyrophosphatase A1_Propeptide SeqID_122475 SeqID_123036 A1 PropeptideSeqID_123359 SeqID_121455 SeqID_120476 SeqID_120494 SeqID_123971 PRP1_NSeqID_121212 PRP1 splicing factor, N-terminal RhoGAP SeqID_120222SeqID_120311 RhoGAP domain G-alpha SeqID_120144 SeqID_120531 G-proteinalpha subunit SeqID_120703 SeqID_121138 Guanylate_kin SeqID_120766Guanylate kinase HSF_DNA-bind SeqID_123050 HSF-type DNA-bindingDnaJ_CXXCXGXG SeqID_121279 SeqID_122722 DnaJ central domain (4 repeats)SeqID_121405 SeqID_121507 SeqID_120879 SeqID_121218 SeqID_123882Collagen SeqID_121964 SeqID_121996 Collagen triple helix repeat (20SeqID_122013 SeqID_122019 copies) SeqID_122027 SeqID_122216 SeqID_122291SeqID_122304 SeqID_122313 SeqID_122472 SeqID_122536 SeqID_122650SeqID_121377 SeqID_122871 SeqID_122911 SeqID_122954 SeqID_123029SeqID_123087 SeqID_123176 SeqID_123223 SeqID_123423 SeqID_119253SeqID_119404 SeqID_119418 SeqID_119540 SeqID_119564 SeqID_119634SeqID_119798 SeqID_119948 SeqID_119971 SeqID_120106 SeqID_120175SeqID_120214 SeqID_120361 SeqID_120474 SeqID_120626 SeqID_120674SeqID_120705 SeqID_123663 SeqID_121026 SeqID_121073 SeqID_121150SeqID_123664 SeqID_123677 SeqID_123676 SeqID_123769 SeqID_123783SeqID_123827 SeqID_121953 SeqID_121958 Tubulin_C SeqID_122269SeqID_122332 Tubulin/FtsZ family, C-terminal SeqID_122440 SeqID_119159domain SeqID_119381 SeqID_119720 SeqID_120775 SeqID_123545 SeqID_123884SeqID_123998 SeqID_121849 SeqID_121940 JmjC SeqID_122284 SeqID_120520jmjC domain Zona_pellucida SeqID_119201 SeqID_120047 Zona pellucida-likedomain CH SeqID_122025 SeqID_122264 Calponin homology (CH) domainSeqID_122337 SeqID_122473 SeqID_122788 SeqID_123499 SeqID_119750SeqID_120539 SeqID_123855 SeqID_124085 SeqID_124206 E-MAP-115SeqID_120063 E-MAP-115 family CPSase_L_D2 SeqID_119260 SeqID_119619Carbamoyl-phosphate synthase L SeqID_120976 chain, A HMG_boxSeqID_122876 SeqID_122918 HMG (high mobility group) box SeqID_121497SeqID_120582 SeqID_120832 SeqID_120943 Ribosomal_S25 SeqID_122611SeqID_122695 S25 ribosomal protein SeqID_123417 SeqID_123597SeqID_124222 SeqID_124340 Ribosomal_S27 SeqID_122053 SeqID_122726Ribosomal protein S27a SeqID_121490 SeqID_123637 SeqID_123659Galactosyl_T SeqID_120408 Galactosyltransferase CS SeqID_122778SeqID_122828 CS domain SeqID_123335 SeqID_121480 SeqID_120610Voltage_CLC SeqID_120735 SeqID_121213 Voltage gated chloride channelLactamase_B SeqID_120930 Metallo-beta-lactamase superfamily eRF1_1SeqID_119244 eRF1 domain 1 eRF1_2 SeqID_123076 SeqID_119244 eRF1 domain2 Flavoprotein SeqID_122326 SeqID_119773 Flavoprotein eRF1_3SeqID_123076 SeqID_119244 eRF1 domain 3 EMP24_GP25L SeqID_122517SeqID_119571 emp24/gp25L/p24 family SeqID_123733 NOT2_3_5 SeqID_122445SeqID_121346 NOT2/NOT3/NOT5 family SeqID_123876 Hydantoinase_ASeqID_121246 SeqID_121266 Hydantoinase/oxoprolinase Plus-3 SeqID_122270Plus-3 domain IBB SeqID_122870 SeqID_121423 Importin beta binding domainSeqID_119952 SeqID_121148 Ald_Xan_dh_C2 SeqID_120485 SeqID_121912Aldehyde oxidase and xanthine dehydroge Complex1_49kDa SeqID_120479Respiratory-chain NADH dehydrogenase, SSrecog SeqID_120076Structure-specific recognition protein Aldo_ket_red SeqID_123277Aldo/keto reductase family TFIIS_C SeqID_122597 SeqID_123714Transcription factor S-II (TFIIS) Thioredoxin SeqID_121221 SeqID_122115Thioredoxin SeqID_122712 SeqID_123314 SeqID_120315 SeqID_120528SeqID_121134 SeqID_123686 SeqID_123796 SeqID_121887 vATP-synt_ESeqID_122754 SeqID_121463 ATP synthase (E/31 kDa) subunit SeqID_119218SeqID_124020 SeqID_121724 Metallothio SeqID_119714 MetallothineinMo-co_dimer SeqID_122426 SeqID_119301 Mo-co oxidoreductase dimensationSeqID_123908 doma ORC2 SeqID_121926 Origin recognition complex subunit 2Ribosomal_S30 SeqID_122415 SeqID_123507 Ribosomal protein S30SeqID_120390 SeqID_123707 SCP SeqID_121229 SeqID_121227 SCP-likeextracellular protein SeqID_121228 SeqID_122770 SeqID_122767SeqID_122864 SeqID_123501 SeqID_121425 SeqID_119399 SeqID_119556SeqID_120347 SeqID_121641 GPP34 SeqID_120614 Golgi phosphoprotein 3(GPP34) DM SeqID_121139 DM DNA binding domain GTP_CDC SeqID_119791 Celldivision protein AhpC-TSA SeqID_122512 SeqID_122760 AhpC/TSA familySeqID_123258 SeqID_123300 SeqID_119348 SeqID_123739 SeqID_123758 CAP_GLYSeqID_123227 CAP-Gly domain XRCC1_N SeqID_120679 XRCC1 N terminal domainDUF26 SeqID_122879 Domain of unknown function DUF26 TRAP_betaSeqID_122465 SeqID_119980 Translocon-associated protein SeqID_123848beta (TRAPB) Cation_ATPase_N SeqID_122184 SeqID_119574 Cationtransporter/ATPase, N- SeqID_119844 SeqID_120162 terminus SeqID_124141SeqID_124234 XPG_I SeqID_120903 XPG I-region TFIID-31kDa SeqID_122432SeqID_123899 Transcription initiation factor IID, 31 kD ARPC4SeqID_123032 SeqID_121098 ARP2/3 complex 20 kDa subunit (ARPC4) NopSeqID_122208 SeqID_119511 Putative snoRNA binding domain SeqID_120274SeqID_123696 DX SeqID_119623 DX module Choline_kinase SeqID_122682SeqID_124140 Choline/ethanolamine kinase SeqID_121776 Seryl_tRNA_NSeqID_122169 SeqID_124027 Seryl-tRNA synthetase N-terminal domain UFD1SeqID_122278 SeqID_122991 Ubiquitin fusion degradation SeqID_120363SeqID_123652 protein UFD1 AICARFT_IMPCHas SeqID_121071 AICARFT/IMPCHasebienzyme Adap_comp_sub SeqID_122747 SeqID_119495 Adaptor complexesmedium SeqID_120295 subunit family V_ATPase_I SeqID_121161 V-type ATPase116 kDa subunit family eIF-6 SeqID_122640 SeqID_120843 eIF-6 familySeqID_123945 TAF SeqID_121234 SeqID_122483 TATA box binding proteinSeqID_122744 SeqID_122771 associated fac SeqID_119330 SeqID_121479SeqID_121482 SeqID_124258 SeqID_124262 HSP70 SeqID_121224 SeqID_121246Hsp70 protein SeqID_121258 SeqID_121266 SeqID_122340 SeqID_122574SeqID_121338 SeqID_123128 SeqID_123276 SeqID_121398 SeqID_121394SeqID_121403 SeqID_121456 SeqID_119310 SeqID_119878 SeqID_121465SeqID_121528 SeqID_120193 SeqID_120276 SeqID_120425 SeqID_120426SeqID_120477 SeqID_120618 SeqID_120675 SeqID_123580 SeqID_121647SeqID_123958 SeqID_121785 Rho_GDI SeqID_123503 SeqID_119733 RHO proteinGDP dissociation SeqID_121942 inhibitor E2F_TDP SeqID_122353SeqID_120460 E2F/DP family winged-helix DNA- SeqID_120823 SeqID_124183binding domai VPS28 SeqID_122699 SeqID_119677 VPS28 protein SeqID_124233SNARE SeqID_122431 SeqID_123902 SNARE domain CBM_14 SeqID_120683 Chitinbinding Peritrophin-A domain Fic SeqID_119770 Fic protein familyPeptidase_M16_C SeqID_123083 SeqID_119755 Peptidase M16 inactive domainefhand SeqID_122031 SeqID_122171 EF hand SeqID_122356 SeqID_122443SeqID_122544 SeqID_122641 SeqID_122769 SeqID_122783 SeqID_122802SeqID_122976 SeqID_123119 SeqID_119228 SeqID_119553 SeqID_119614 zf-CCHCSeqID_122365 SeqID_122548 Zinc knuckle SeqID_123105 SeqID_121406SeqID_121418 SeqID_119748 SeqID_119884 SeqID_119889 SeqID_121017SeqID_121081 SeqID_121192 SeqID_123993 Rieske SeqID_119282 Rieske[2Fe—2S] domain EF_TS SeqID_122638 SeqID_121030 Elongation factor TSGTP_EFTU SeqID_122392 SeqID_122600 Elongation factor Tu GTP bindingSeqID_122657 SeqID_121329 domain SeqID_122792 SeqID_122872 SeqID_123241SeqID_121430 SeqID_119311 SeqID_119332 SeqID_119352 SeqID_119600SeqID_120413 SeqID_120540 SeqID_121558 SeqID_121576 SeqID_121613SeqID_123754 SeqID_121709 SeqID_123950 SeqID_124180 SeqID_121717SeqID_121740 SeqID_121805 SeqID_121824 TPD52 SeqID_120171 Tumour proteinD52 family UCR_TM SeqID_119282 SeqID_121586 Ubiquinol cytochromereductase SeqID_121732 SeqID_121833 transmembrane Coq4 SeqID_120870Coenzyme Q (ubiquinone) biosynthesis protein zf-HIT SeqID_123532 HITzinc finger CUB SeqID_123097 SeqID_119304 CUB domain DUF32 SeqID_119459SeqID_119801 Domain of unknown function SeqID_121149 DUF32 adh_shortSeqID_121285 SeqID_122087 short chain dehydrogenase SeqID_122187SeqID_122421 SeqID_122523 SeqID_122530 SeqID_122596 SeqID_121476SeqID_120081 SeqID_120916 SeqID_123628 SeqID_123706 SeqID_123772SeqID_123943 RGS SeqID_121152 Regulator of G protein signaling PMMSeqID_122570 SeqID_124322 Eukaryotic phosphomannomutase ER SeqID_121000Enhancer of rudimentary Patched SeqID_120169 SeqID_120785 Patched familySeqID_120998 RRM_1 SeqID_121236 SeqID_121239 RNA recognition motif.(a.k.a. SeqID_122099 SeqID_122100 RRM, RBD, or SeqID_122122 SeqID_122126SeqID_122158 SeqID_122202 SeqID_122250 SeqID_122361 SeqID_122469SeqID_122526 SeqID_122528 SeqID_122549 SeqID_122676 SeqID_122700SeqID_122831 SeqID_122888 SeqID_122938 SeqID_122962 SeqID_123004SeqID_123008 SeqID_123071 SeqID_123101 SeqID_123331 SeqID_123370SeqID_123384 SeqID_123396 SeqID_123424 SeqID_119150 SeqID_119180SeqID_119191 SeqID_119215 SeqID_119241 SeqID_124159 SeqID_119361SeqID_119362 SeqID_119398 SeqID_119489 SeqID_119527 SeqID_119551SeqID_124128 I-set SeqID_119655 SeqID_119820 Immunoglobulin I-set domainSeqID_119864 SeqID_119970 SeqID_121498 SeqID_121513 SeqID_120027SeqID_123815 SeqID_120174 SeqID_120251 SeqID_120319 SeqID_120336SeqID_120348 SeqID_120467 SeqID_120468 SeqID_120514 SeqID_120570SeqID_120741 SeqID_120765 SeqID_123671 SeqID_121559 SeqID_121578SeqID_121599 SeqID_121600 SeqID_121609 SeqID_120880 SeqID_120956SeqID_121015 SeqID_121055 SeqID_121119 SeqID_121160 SeqID_123601SeqID_123673 SeqID_123785 SeqID_123791 SeqID_123809 SeqID_123819SeqID_123820 SeqID_123839 SeqID_121697 SeqID_124070 SeqID_124091SeqID_124152 SeqID_124176 SeqID_124194 SeqID_124259 SeqID_121731SeqID_121748 SeqID_121845 SeqID_121882 SeqID_121928 SeqID_121933SeqID_123529 SeqID_119158 SeqID_119196 SeqID_119491 SeqID_119569SeqID_119713 SeqID_119816 SeqID_120343 TBC SeqID_119295 SeqID_119731 TBCdomain SeqID_121193 Calpain_III SeqID_120374 Calpain large subunit,domain III CBFD_NFYB_HMF SeqID_121234 SeqID_122039 Histone-liketranscription factor SeqID_122046 SeqID_122119 (CBF/ SeqID_122455SeqID_122483 SeqID_122582 SeqID_122688 SeqID_122744 SeqID_122766SeqID_122771 SeqID_121314 SeqID_123003 SeqID_123317 SeqID_123463SeqID_123473 SeqID_121448 SeqID_121446 SeqID_119330 SeqID_119493SeqID_119512 SeqID_119521 SeqID_119609 SeqID_119776 SeqID_121479SeqID_121482 SeqID_120790 SeqID_121601 SeqID_123744 SeqID_121643SeqID_121663 SeqID_121675 SeqID_121684 SeqID_121688 SeqID_121706SeqID_124105 SeqID_124164 SeqID_124258 SeqID_124262 SeqID_124276SeqID_121771 SeqID_121809 SeqID_121823 SeqID_121905 UDPG_MGDP_dh_CSeqID_123494 UDP-glucose/GDP-mannose dehydrogenase zf-UBP SeqID_122511SeqID_122662 Zn-finger in ubiquitin-hydrolases SeqID_120244 and otherUPF0184 SeqID_122379 SeqID_123860 Uncharacterised protein family(UPF0184) FF SeqID_121478 SeqID_120846 FF domain SeqID_121180 RhodaneseSeqID_123181 SeqID_123269 Rhodanese-like domain TBP SeqID_120108SeqID_121146 Transcription factor TFIID (or SeqID_121734 TATA-bindingCytochrom_C1 SeqID_119212 Cytochrome C1 family PI-PLC-Y SeqID_120248Phosphatidylinositol-specific phospholipase Glycolytic SeqID_121242SeqID_121268 Fructose-bisphosphate aldolase SeqID_122585 SeqID_121503class-I SeqID_120227 SeqID_120758 SeqID_123730 SeqID_124335 SeqID_121856COX15-CtaA SeqID_119736 Cytochrome oxidase assembly Actin SeqID_121231SeqID_121257 Actin SeqID_122239 SeqID_122271 SeqID_122678 SeqID_122922SeqID_122921 SeqID_122940 SeqID_122974 SeqID_122977 SeqID_122985SeqID_123092 SeqID_123347 SeqID_123374 SeqID_123381 SeqID_123376SeqID_123378 SeqID_123388 SeqID_123397 SeqID_123394 SeqID_123402SeqID_119184 SeqID_121411 SeqID_119898 SeqID_121581 SeqID_121592SeqID_123541 SeqID_123542 SeqID_123817 SeqID_123843 SeqID_121690SeqID_124081 SeqID_124084 SeqID_124093 SeqID_124188 SeqID_124303SeqID_121834 SeqID_121888 SeqID_121911 ATP_synt_H SeqID_122810SeqID_120261 ATP synthase subunit H SeqID_124197 SET SeqID_122866SeqID_123437 SET domain SeqID_119452 SeqID_119703 SeqID_121141Ribosomal_L5_C SeqID_121970 SeqID_122399 ribosomal L5P family C-terminusSeqID_122665 SeqID_123116 SeqID_123183 SeqID_120376 SeqID_123547SeqID_124178 ADK_lid SeqID_121282 SeqID_122317 Adenylate kinase, activesite lid GrpE SeqID_122071 SeqID_123157 GrpE SeqID_123878 SeqID_121526SeqID_120867 SeqID_123661 SeqID_121743 XRN_N SeqID_122345 SeqID_121207XRN 5′-3′ exonuclease N-terminus SeqID_124106 Ribosomal_L1 SeqID_122679SeqID_122715 Ribosomal protein L1p/L10e family SeqID_124062 RhoGEFSeqID_122254 SeqID_120745 RhoGEF domain Y_phosphatase SeqID_123060SeqID_119220 Protein-tyrosine phosphatase SeqID_119686 SeqID_119866SeqID_119975 SeqID_120184 SeqID_120234 SeqID_120792 SeqID_121037SeqID_121036 SeqID_121105 Ribosomal_L2 SeqID_122387 SeqID_122576Ribosomal Proteins L2, RNA SeqID_119709 SeqID_121219 binding domSeqID_123662 SeqID_124015 SeqID_121891 7tm_1 SeqID_119786 7transmembrane receptor (rhodopsin family) Ribosomal_L3 SeqID_122659SeqID_122727 Ribosomal protein L3 SeqID_123246 SeqID_121145 SeqID_123585Sdh_cyt SeqID_122442 SeqID_123881 Succinate dehydrogenase cytochrome bsubunit DNA_topoisoIV SeqID_120500 DNA gyrase/topoisomerase IV, subunitA 7tm_2 SeqID_120113 7 transmembrane receptor (Secretin family)Ribosomal_L4 SeqID_122113 SeqID_122385 Ribosomal protein L4/L1 familySeqID_121324 SeqID_122845 SeqID_121127 SeqID_123536 SeqID_123727 7tm_3SeqID_120501 SeqID_120729 7 transmembrane receptor SeqID_120999(metabotropic gluta Ribosomal_L5 SeqID_121970 SeqID_122399 Ribosomalprotein L5 SeqID_122665 SeqID_122908 SeqID_123116 SeqID_123183SeqID_123517 SeqID_123547 SeqID_124178 PAPS_reduct SeqID_120874Phosphoadenosine phosphosulfate reductase Ribosomal_L6 SeqID_122583SeqID_123234 Ribosomal protein L6 SeqID_121602 SeqID_120932 SeqID_123631ADAM_spacer1 SeqID_119479 ADAM-TS Spacer 1 HSP90 SeqID_121981SeqID_122028 HSP90 protein SeqID_122738 SeqID_121524 SeqID_120445SeqID_120787 SeqID_121577 SeqID_123813 SeqID_121774 SeqID_121829SeqID_121895 Abhydrolase_1 SeqID_119405 SeqID_121610 alpha/betahydrolase fold Peptidase_M1 SeqID_120743 SeqID_120851 Peptidase familyM1 Herpes_LP SeqID_120428 Herpesvirus leader protein Pescadillo_NSeqID_122934 Pescadillo N-terminus Abhydrolase_3 SeqID_119940SeqID_120282 alpha/beta hydrolase fold SeqID_120891 CPSase_L_chainSeqID_119260 SeqID_119619 Carbamoyl-phosphate synthase L chain,PMI_typeI SeqID_122372 SeqID_123985 Phosphomannose isomerase type IGlyco_hydro_18 SeqID_122290 SeqID_119433 Glycosyl hydrolases family 18SeqID_123731 SeqID_124053 Profilin SeqID_122484 SeqID_120556 ProfilinSeqID_123780 SeqID_121762 SeqID_121862 RIO1 SeqID_120808 RIO1 familyTCTP SeqID_122571 SeqID_123113 Translationally controlled tumourSeqID_121485 SeqID_120084 protein SeqID_120085 SeqID_123959 NTF2SeqID_122143 SeqID_123505 Nuclear transport factor 2 (NTF2) SeqID_123530SeqID_119273 domain SeqID_120761 AP_endonuc_2 SeqID_120685 Xyloseisomerase-like TIM barrel GATase_2 SeqID_122634 SeqID_119643 Glutamineamidotransferases SeqID_123847 class-II RRS1 SeqID_122559 SeqID_122857Ribosome biogenesis regulatory SeqID_123569 protein (RRS1 Gln-synt_CSeqID_122482 SeqID_120289 Glutamine synthetase, catalytic domainPribosyltran SeqID_122656 SeqID_122880 Phosphoribosyl transferaseSeqID_120272 SeqID_124138 domain DUF367 SeqID_120808 Domain of unknownfunction (DUF367) PWP2 SeqID_121293 Periodic tryptophan protein 2 WDrepeat asso RNA_pol_Rpa2_4 SeqID_119390 RNA polymerase I, Rpa2 specificdomain HesB SeqID_123420 HesB-like domain SPRY SeqID_122960 SeqID_119817SPRY domain COX4 SeqID_122622 SeqID_122739 Cytochrome c oxidase subunitIV SeqID_121444 SeqID_121470 SeqID_120419 SeqID_121605 SeqID_123898SeqID_121775 Gp-FAR-1 SeqID_122790 SeqID_123533 Nematode fatty acidretinoid SeqID_120157 SeqID_123934 binding protein SeqID_124215Gln-synt_N SeqID_122482 SeqID_123792 Glutamine synthetase, beta-Graspdomain Transketolase_C SeqID_119346 Transketolase, C-terminal domain CtrSeqID_119988 Ctr copper transporter family RCC1 SeqID_122092SeqID_120326 Regulator of chromosome SeqID_123910 condensation (RCC1)Pkinase_Tyr SeqID_121233 SeqID_121271 Protein tyrosine kinaseSeqID_122086 SeqID_122117 SeqID_122221 SeqID_122233 SeqID_122285SeqID_122384 SeqID_122470 SeqID_122618 SeqID_122687 SeqID_122758SeqID_123026 SeqID_123421 SeqID_123491 SeqID_119148 SeqID_121409SeqID_121416 SeqID_119206 SeqID_119219 SeqID_119238 SeqID_119245SeqID_119296 SeqID_119338 SeqID_119410 SeqID_119412 SeqID_119422SeqID_119436 SeqID_119453 SeqID_119472 SeqID_119483 SeqID_119520SeqID_119525 SeqID_119541 SeqID_119728 SeqID_119764 SeqID_119789SeqID_119828 SeqID_120006 SeqID_120051 SeqID_120126 SeqID_120166SeqID_120298 SeqID_120406 SeqID_120418 SeqID_120428 SeqID_120452SeqID_120453 SeqID_120550 SeqID_120636 SeqID_120663 SeqID_120672SeqID_120772 SeqID_121622 SeqID_120934 SeqID_120996 SeqID_121189SeqID_123672 SeqID_123838 SeqID_123977 SeqID_123989 SeqID_124109SeqID_124294 SeqID_121714 SeqID_121742 SeqID_121803 SeqID_121897SeqID_121907 OSCP SeqID_122390 SeqID_123173 ATP synthase delta (OSCP)SeqID_123852 SeqID_121694 subunit Ham1p_like SeqID_120320 Ham1 familyTransketolase_N SeqID_122651 SeqID_119261 Transketolase, thiamineSeqID_123846 diphosphate b HD SeqID_119514 SeqID_119787 HD domainMreB_Mbl SeqID_121224 SeqID_121246 MreB/Mbl protein SeqID_121258SeqID_121266 SeqID_122340 SeqID_119310 SeqID_120675 SeqID_123580Fzo_mitofusin SeqID_119682 fzo-like conserved region GCFC SeqID_120111GC-rich sequence DNA-binding factor-like pro DER1 SeqID_123172SeqID_120680 Der1-like family Phosphorylase SeqID_120862 Carbohydratephosphorylase SH2 SeqID_122470 SeqID_123228 SH2 domain SeqID_119500SeqID_120234 SeqID_120452 SeqID_120453 SeqID_120921 SeqID_120958 CXCSeqID_121122 Tesmin/TSO1-like CXC domain Aldedh SeqID_120628 Aldehydedehydrogenase family CK_II_beta SeqID_120267 Casein kinase II regulatorysubunit ERM SeqID_121435 SeqID_121811 Ezrin/radixin/moesin family3HCDH_N SeqID_122892 SeqID_119503 3-hydroxyacyl-CoA SeqID_120447dehydrogenase, NAD binding Troponin SeqID_121972 SeqID_122915 TroponinSeqID_123477 SeqID_120512 SeqID_123537 zf-U1 SeqID_122257 SeqID_122817U1 zinc finger SeqID_119881 Dynamin_M SeqID_121459 SeqID_121020 Dynamincentral region LBP_BPI_CETP_C SeqID_121024 LBP/BPI/CETP family, C-terminal do UBA SeqID_122024 SeqID_122255 UBA/TS-N domain SeqID_122435SeqID_122689 SeqID_120244 SeqID_123742 SeqID_123895 SeqID_121722SeqID_121739 Dynamin_N SeqID_122325 SeqID_119276 Dynamin familySeqID_119682 SeqID_121020 SeqID_123830 FG-GAP SeqID_120791 FG-GAP repeatSupt5 SeqID_119904 Supt5 repeat CHORD SeqID_122778 SeqID_121480 CHORDSeqID_120611 SeqID_124127 Ribosomal_S6e SeqID_122091 SeqID_122108Ribosomal protein S6e SeqID_123156 SeqID_123278 SeqID_123328SeqID_123341 SeqID_123427 SeqID_119761 SeqID_123571 SeqID_124292Gtr1_RagA SeqID_123193 SeqID_121217 Gtr1/RagA G protein conserved regionCAF1 SeqID_122750 SeqID_123054 CAF1 family ribonuclease SeqID_123367SeqID_124146 SeqID_121735 RNA_pol_Rpb6 SeqID_122713 SeqID_120963 RNApolymerase Rpb6 SeqID_124170 Hist_deacetyl SeqID_122658 SeqID_120496Histone deacetylase domain SeqID_124304 RNA_pol_Rpb8 SeqID_122008SeqID_122789 RNA polymerase Rpb8 SeqID_119393 SeqID_123747Ribosomal_L10e SeqID_122247 SeqID_122929 Ribosomal L10 SeqID_123115SeqID_123302 SeqID_123354 SeqID_121429 SeqID_119804 SeqID_123702SeqID_121800 SeqID_121851 DUF1127 SeqID_121449 Domain of unknownfunction (DUF1127) FARP SeqID_121261 SeqID_122785 FMRFamide relatedpeptide family ubiquitin SeqID_121249 SeqID_121259 Ubiquitin familySeqID_121959 SeqID_121966 SeqID_121974 SeqID_121987 SeqID_122018SeqID_122017 SeqID_122021 SeqID_122020 SeqID_122023 SeqID_122026SeqID_122033 SeqID_122042 SeqID_122053 SeqID_122150 SeqID_122234SeqID_122256 SeqID_122415 SeqID_122461 SeqID_122550 SeqID_122647SeqID_122726 SeqID_122768 SeqID_121317 SeqID_121325 SeqID_122784SeqID_122823 SeqID_123207 SeqID_123316 SeqID_123462 SeqID_123488SeqID_123500 SeqID_119205 SeqID_121408 SeqID_121458 SeqID_119321SeqID_119375 SeqID_119457 SeqID_119597 SeqID_119702 SeqID_119857mRNA_cap_enzyme SeqID_123044 SeqID_120040 mRNA capping enzyme, catalyticdomain Ribosomal_60s SeqID_122215 SeqID_122490 60s Acidic ribosomalprotein SeqID_123784 SeqID_124059 SHMT SeqID_123444 Serinehydroxymethyltransferase TSP_1 SeqID_121277 SeqID_119765 Thrombospondintype 1 domain SeqID_121174 Bin3 SeqID_122035 SeqID_120646Bicoid-interacting protein 3 (Bin3) SeqID_123803 APS_kinase SeqID_119710Adenylylsulphate kinase GSH_synthase SeqID_122183 SeqID_119391Eukaryotic glutathione synthase SeqID_120121 SeqID_120409 SeqID_120622SeqID_120719 SeqID_120886 SeqID_120895 SeqID_121074 SeqID_123719 SFT2SeqID_123142 SeqID_121421 SFT2-like protein SeqID_120229 HomeoboxSeqID_119358 SeqID_119397 Homeobox domain SeqID_119915 SeqID_120364SeqID_120535 SeqID_121140 Pox_A_type_inc SeqID_121260 SeqID_122546 ViralA-type inclusion protein SeqID_123618 repeat iPGM_N SeqID_122467SeqID_123844 BPG-independent PGAM N- SeqID_120794 terminus (iPGM_N)RNA_pol_L SeqID_122755 SeqID_120230 RNA polymerase Rpb3/Rpb11SeqID_123595 dimerisation doma V-set SeqID_123529 SeqID_119196Immunoglobulin V-set domain SeqID_119491 SeqID_119713 SeqID_119816SeqID_120343 CTP_synth_N SeqID_122680 SeqID_120279 CTP synthaseN-terminus SeqID_120818 SeqID_123812 AAA SeqID_122138 SeqID_122219ATPase family associated with SeqID_122276 SeqID_122358 various cellulSeqID_122493 SeqID_122675 SeqID_121315 SeqID_121378 SeqID_123161SeqID_119176 SeqID_121460 SeqID_119267 SeqID_119300 SeqID_119612SeqID_119652 SeqID_119734 SeqID_119875 SeqID_120119 SeqID_120736SeqID_121618 SeqID_120857 SeqID_120931 SeqID_120938 SeqID_123801SeqID_123871 SeqID_124113 SeqID_124117 SeqID_121875 SeqID_121923PP-binding SeqID_123286 SeqID_123525 Phosphopantetheine attachmentSeqID_120461 SeqID_124213 site SeqID_124214 CDC37 SeqID_120246 Cdc37family FtsJ SeqID_120165 FtsJ-like methyltransferase Peroxin-13_NSeqID_119323 Peroxin 13, N-terminal Ribosomal_S7e SeqID_122524SeqID_121334 Ribosomal protein S7e SeqID_123124 SeqID_121922 Sugar_trSeqID_119607 SeqID_120612 Sugar (and other) transporter UCH SeqID_122246SeqID_122435 Ubiquitin carboxyl-terminal SeqID_122511 SeqID_121433hydrolase SeqID_119366 SeqID_119785 SeqID_119799 SeqID_120244SeqID_120577 SeqID_121090 SeqID_123828 SeqID_123895 SeqID_121784HATPase_c SeqID_121965 SeqID_121968 Histidine kinase-, DNA gyrase B-,SeqID_121976 SeqID_122028 and HSP90 SeqID_122252 SeqID_123282SeqID_124089 SeqID_120445 SeqID_123704 Activin_recp SeqID_123267 Activintypes I and II receptor domain DUF602 SeqID_119868 Protein of unknownfunction, DUF602 DUF1136 SeqID_119196 Repeat of unknown function(DUF1136) TAFII28 SeqID_123141 hTAFII28-like protein conserved regionPkinase SeqID_121233 SeqID_121271 Protein kinase domain SeqID_122086SeqID_122117 SeqID_122221 SeqID_122233 SeqID_122285 SeqID_122384SeqID_122470 SeqID_122502 SeqID_122618 SeqID_122687 SeqID_122758SeqID_122956 SeqID_123026 SeqID_123491 SeqID_119148 SeqID_121409SeqID_121416 SeqID_121441 SeqID_119219 SeqID_119237 SeqID_119238SeqID_119245 SeqID_119296 SeqID_119338 SeqID_119365 SeqID_119410SeqID_119412 SeqID_119436 SeqID_119453 SeqID_119472 SeqID_119483SeqID_119520 SeqID_119525 SeqID_119576 SeqID_119629 SeqID_119728SeqID_119764 SeqID_119789 SeqID_119828 SeqID_119968 SeqID_120006SeqID_120126 SeqID_120166 SeqID_120208 SeqID_120298 SeqID_120406SeqID_120428 SeqID_120452 SeqID_120453 SeqID_120550 SeqID_120636SeqID_120663 SeqID_120672 SeqID_120701 SeqID_120771 SeqID_120772SeqID_121561 SeqID_121583 SeqID_121622 SeqID_120796 SeqID_120934SeqID_120996 SeqID_121102 SeqID_121189 SeqID_123672 SeqID_123947SeqID_123977 SeqID_123989 SeqID_124109 SeqID_124130 SeqID_124294SeqID_121714 SeqID_121742 SeqID_121803 SeqID_121897 SeqID_121903SeqID_121907 SeqID_121920 KH_1 SeqID_122074 SeqID_122460 KH domainSeqID_121313 SeqID_123392 SeqID_123476 SeqID_119195 SeqID_120972SeqID_124132 SeqID_121787 FA_hydroxylase SeqID_122136 SeqID_123602 Fattyacid hydroxylase Clc-like SeqID_122446 SeqID_120339 Clc-likeSeqID_120721 SeqID_123875 KH_2 SeqID_122074 SeqID_122370 KH domainSeqID_123179 SeqID_121535 SeqID_121549 SeqID_123699 SeqID_124132SeqID_124172 SeqID_124328 SeqID_121787 Galactosyl_T_2 SeqID_122573SeqID_122681 Galactosyltransferase SeqID_119322 SeqID_124349 PiwiSeqID_121243 SeqID_121305 Piwi domain SeqID_123411 SeqID_119347SeqID_119676 SeqID_120129 SeqID_121773 SeqID_121792 RLI SeqID_122076SeqID_120449 Possible metal-binding domain in SeqID_120808 SeqID_124131RNase L inh HORMA SeqID_121286 SeqID_119567 HORMA domain SeqID_121728RNA_pol_Rpb1_3 SeqID_120039 SeqID_120437 RNA polymerase Rpb1, domain 3SeqID_120506 Ldh_2 SeqID_122223 SeqID_121410 Malate/L-lactatedehydrogenase SeqID_121556 SeqID_121574 SeqID_124002 NeuralizedSeqID_122352 SeqID_120365 Neuralized SeqID_124000 SeqID_124184RNA_pol_Rpb1_4 SeqID_120039 SeqID_120437 RNA polymerase Rpb1, domain 4SeqID_120506 SeqID_120541 RNA_pol_Rpb1_5 SeqID_120506 RNA polymeraseRpb1, domain 5 RNA_pol_Rpb1_6 SeqID_120506 RNA polymerase Rpb1, domain 6Clat_adaptor_s SeqID_122102 SeqID_122224 Clathrin adaptor complex smallSeqID_122801 SeqID_122988 chain SeqID_120385 SeqID_120859 SeqID_121131SeqID_123790 SeqID_124166 IF4E SeqID_122412 SeqID_120589 Eukaryoticinitiation factor 4E Kinesin SeqID_121296 SeqID_122362 Kinesin motordomain SeqID_119669 SeqID_124181 G10 SeqID_122515 G10 proteinGround-like SeqID_120751 Ground-like domain P34-Arc SeqID_123215SeqID_121855 Arp2/3 complex, 34 kD subunit p34-Arc Ribosomal_S8eSeqID_122003 SeqID_122010 Ribosomal protein S8e SeqID_122130SeqID_122508 SeqID_121333 SeqID_123027 SeqID_121508 SeqID_121531SeqID_121545 SeqID_123644 SeqID_121667 Ribosomal_S3_C SeqID_122370SeqID_123179 Ribosomal protein S3, C-terminal SeqID_119376 SeqID_121535domai SeqID_121126 SeqID_123699 ResIII SeqID_122318 SeqID_119170 TypeIII restriction enzyme, res SeqID_121443 SeqID_119233 subunitSeqID_119766 SeqID_120168 SeqID_123835 SeqID_124119 TFIIE_betaSeqID_121012 TFIIE beta subunit core domain AA_kinase SeqID_122430 Aminoacid kinase family Exo_endo_phos SeqID_122447 SeqID_123874Endonuclease/Exonuclease/phosphatase fa HLH SeqID_122206 SeqID_119868Helix-loop-helix DNA-binding SeqID_120631 domain Keratin_B2 SeqID_120481Keratin, high sulfur B2 protein TspO_MBR SeqID_122231 SeqID_122587TspO/MBR family SeqID_123338 SeqID_121431 SeqID_119499 SeqID_123778SeqID_123779 SeqID_124344 SeqID_121908 C1-set SeqID_119491 SeqID_120343Immunoglobulin C1-set domain SCO1-SenC SeqID_121873 SCO1/SenC T-boxSeqID_122717 SeqID_123320 T-box PSI SeqID_119510 Plexin repeat AAA_2SeqID_122219 SeqID_122276 ATPase family associated with SeqID_121378SeqID_119652 various cellul SeqID_119734 SeqID_121618 SeqID_120931SeqID_123871 SeqID_124113 SeqID_121923 DUF477 SeqID_122322 SeqID_124018Domain of unknown function (DUF477) AAA_3 SeqID_122493 SeqID_119176ATPase family associated with SeqID_119742 SeqID_119875 various cellulSeqID_120649 SeqID_121618 ABC_membrane SeqID_120493 ABC transportertransmembrane region fn3 SeqID_119158 SeqID_119225 Fibronectin type IIIdomain SeqID_119569 SeqID_120170 SeqID_120951 SeqID_121023 AAA_5SeqID_122219 SeqID_122493 ATPase family associated with SeqID_119176SeqID_119875 various cellul SeqID_120649 SeqID_120763 SeqID_120931SeqID_120936 SeqID_123871 Destabilase SeqID_122806 SeqID_124205Destabilase Glyco_transf_22 SeqID_123460 SeqID_120564 Alg9-likemannosyltransferase family Not3 SeqID_122203 SeqID_121162 Not1N-terminal domain, CCR4- Not complex com CDC50 SeqID_122266 SeqID_121335LEM3 (ligand-effect modulator 3) SeqID_120160 SeqID_123808 family/CDGlyco_transf_25 SeqID_121052 Glycosyltransferase family 25 (LPS bi PSSSeqID_119991 Phosphatidyl serine synthase PRP38 SeqID_123506SeqID_121713 PRP38 family UCR_14kD SeqID_122477 SeqID_121481Ubiquinol-cytochrome C reductase SeqID_121514 SeqID_123716 complex 14kBiopterin_H SeqID_122189 SeqID_122692 Biopterin-dependent aromaticSeqID_122958 SeqID_120567 amino acid h SeqID_123832 SeqID_124293Cofilin_ADF SeqID_122563 SeqID_122620 Cofilin/tropomyosin-type actin-SeqID_121306 SeqID_121419 binding pr SeqID_119918 SeqID_124332 MOZ_SASSeqID_123033 MOZ/SAS family SNase SeqID_121415 SeqID_120155Staphylococcal nuclease homologue Skp1_POZ SeqID_121262 SeqID_122339Skp1 family, tetramerisation SeqID_122672 SeqID_121318 domainSeqID_119994 SeqID_123642 Acyl_transf_3 SeqID_119415 Acyltransferasefamily Ribosomal_L10 SeqID_123472 SeqID_120199 Ribosomal protein L10SeqID_121812 HMA SeqID_122636 SeqID_123900 Heavy-metal-associated domainRibosomal_S3Ae SeqID_122253 SeqID_123202 Ribosomal S3Ae familySeqID_119906 SeqID_121492 SeqID_123962 Ribosomal_L11 SeqID_122466SeqID_122612 Ribosomal protein L11, RNA SeqID_122798 SeqID_120200binding do SeqID_121603 SeqID_120902 SeqID_123787 SeqID_121653SeqID_124218 eIF-1a SeqID_122566 SeqID_123523 Eukaryotic initiationfactor 1A SeqID_119774 SeqID_124173 Ribosomal_L13e SeqID_121985SeqID_121989 Ribosomal protein L13e SeqID_122082 SeqID_119425SeqID_123942 Ribosomal_L12 SeqID_122236 SeqID_121124 Ribosomal proteinL7/L12 C- SeqID_123921 terminal dom S10_plectin SeqID_122608SeqID_119224 Plectin/S10 domain SeqID_123551 Ribosomal_L14 SeqID_122293SeqID_123957 Ribosomal protein L14p/L23e DUF625 SeqID_120752 Protein ofunknown function (DUF625) Sec23_trunk SeqID_121381 SeqID_119647Sec23/Sec24 trunk domain ig SeqID_123529 SeqID_119196 Immunoglobulindomain SeqID_119491 SeqID_119569 SeqID_119713 SeqID_119816 SeqID_120343Ribosomal_L16 SeqID_122247 SeqID_122653 Ribosomal protein L16SeqID_123302 SeqID_124211 Ion_trans SeqID_119930 SeqID_120812 Iontransport protein NAF1 SeqID_119903 NAF1 domain Aa_trans SeqID_121971SeqID_119531 Transmembrane amino acid transporter protein APG6SeqID_122694 SeqID_121500 Autophagy protein Apg6 SEC-C SeqID_120562SEC-C motif KE2 SeqID_122599 SeqID_122773 KE2 family proteinSeqID_124174 SeqID_124230 SeqID_124235 Lyase_1 SeqID_120948 SeqID_121661Lyase SeqID_121786 Ran_BP1 SeqID_119729 SeqID_120815 RanBP1 domainPGM_PMM_IV SeqID_119474 SeqID_120700Phosphoglucomutase/phosphomannomutase, C-t BAH SeqID_123089 BAH domainUQ_con SeqID_121983 SeqID_122089 Ubiquitin-conjugating enzymeSeqID_122437 SeqID_122519 SeqID_122757 SeqID_123232 ENTH SeqID_122942SeqID_123389 ENTH domain DUF6 SeqID_119240 Integral membrane proteinDUF6 Ribosomal_L21e SeqID_122144 SeqID_123108 Ribosomal protein L21eSeqID_123268 SeqID_123293 SeqID_123298 SeqID_121434 SeqID_119745SeqID_123751 SeqID_124341 SeqID_121876 Cyclin_C SeqID_120722 Cyclin,C-terminal domain ADK SeqID_121282 SeqID_122317 Adenylate kinaseSeqID_119710 MAS20 SeqID_121991 SeqID_122410 MAS20 protein importreceptor SeqID_123936 TIG SeqID_122994 IPT/TIG domain DNA_pol_BSeqID_120301 DNA polymerase family B Ribosomal_L22 SeqID_121995SeqID_122261 Ribosomal protein L22p/L17e SeqID_122840 SeqID_123164SeqID_123257 SeqID_121516 SeqID_120038 SeqID_123674 Ribosomal_L14eSeqID_122701 SeqID_123548 Ribosomal protein L14 Ribosomal_L23SeqID_122038 SeqID_123046 Ribosomal protein L23 SeqID_119380SeqID_119408 SeqID_123694 SeqID_124231 SNF2_N SeqID_119766 SeqID_120168SNF2 family N-terminal domain SeqID_120323 Cgr1 SeqID_122506SeqID_123609 Cgr1 family Glutaredoxin SeqID_123129 Glutaredoxin PUASeqID_122707 SeqID_119847 PUA domain tRNA_m1G_MT_9 SeqID_122151SeqID_123209 tRNA m(1)G methyltransferase RNA_pol_Rpb2_3 SeqID_123375SeqID_121089 RNA polymerase Rpb2, domain 3 Ribosomal_L29 SeqID_122312SeqID_122610 Ribosomal L29 protein SeqID_123285 SeqID_119200SeqID_123638 SeqID_123687 SeqID_121725 RNA_pol_Rpb2_4 SeqID_123375SeqID_121089 RNA polymerase Rpb2, domain 4 zf-nanos SeqID_121133 NanosRNA binding domain RNA_pol_Rpb2_5 SeqID_122280 SeqID_119390 RNApolymerase Rpb2, domain 5 SeqID_121089 SeqID_124072 Peptidase_S8SeqID_121230 SeqID_123401 Subtilase family SeqID_119631 PUF SeqID_122211SeqID_122825 Pumilio-family RNA binding repeat SeqID_119161 SeqID_121389SeqID_123824 SeqID_121723 RNA_pol_Rpb2_6 SeqID_122280 SeqID_122552 RNApolymerase Rpb2, domain 6 SeqID_121330 SeqID_123182 SeqID_119390SeqID_123587 SeqID_123831 Cyclin_N SeqID_122428 SeqID_122670 Cyclin,N-terminal domain SeqID_122816 SeqID_121486 SeqID_120722 SeqID_121612SeqID_123768 SeqID_123905 Mod_r SeqID_123251 SeqID_123409 Modifier ofrudimentary (Mod(r)) SeqID_120718 SeqID_123566 protein RNA_pol_Rpb2_7SeqID_122552 RNA polymerase Rpb2, domain 7 Ribosomal_L7Ae SeqID_121269SeqID_122134 Ribosomal protein SeqID_122180 SeqID_122335L7Ae/L30e/S12e/Gadd4 SeqID_122367 SeqID_122404 SeqID_122604 SeqID_122718SeqID_123155 SeqID_123294 SeqID_123455 SeqID_119560 SeqID_123888SeqID_119859 SeqID_121504 SeqID_123633 SeqID_123660 SeqID_123698SeqID_123927 SeqID_123935 SeqID_123992 SeqID_124144 POLO_boxSeqID_119828 SeqID_121022 POLO box duplicated region Nucleoporin2SeqID_120026 Nucleoporin autopeptidase zf-BED SeqID_119515 BED zincfinger Ets SeqID_119450 Ets-domain Ribosomal_S2 SeqID_122439SeqID_122525 Ribosomal protein S2 SeqID_122923 SeqID_122995 SeqID_123127SeqID_123308 SeqID_123382 SeqID_120337 SeqID_120668 SeqID_123668SeqID_124044 Rcd1 SeqID_121351 SeqID_119153 Cell differentiation family,Rcd1- like Ribosomal_S4 SeqID_121533 SeqID_121546 Ribosomal proteinS4/S9 N- terminal domai GMC_oxred_C SeqID_122311 GMC oxidoreductaseRibosomal_S5 SeqID_122366 SeqID_123421 Ribosomal protein S5, N-terminalSeqID_123632 domai DUF1240 SeqID_120294 Protein of unknown function(DUF1240) Topoisom_I SeqID_123167 SeqID_121520 Eukaryotic DNAtopoisomerase I, SeqID_121660 SeqID_121884 catalytic Ribosomal_S6SeqID_122928 SeqID_119470 Ribosomal protein S6 DUF1241 SeqID_122262SeqID_120893 Protein of unknown function (DUF1241) Ribosomal_S7SeqID_122487 SeqID_122488 Ribosomal protein S7p/S5e SeqID_122733SeqID_123383 SeqID_124307 SeqID_123688 SeqID_124272 Ssl1 SeqID_122605SeqID_123911 Ssl1-like SeqID_124066 SeqID_121741 Ribosomal_S8SeqID_122779 SeqID_120839 Ribosomal protein S8 Nop52 SeqID_122283SeqID_120786 Nucleolar protein, Nop52 SeqID_121201 SeqID_124065Ribosomal_L22e SeqID_122791 SeqID_120882 Ribosomal L22e protein familyRibosomal_L30 SeqID_121300 SeqID_122251 Ribosomal protein L30p/L7eSeqID_122565 SeqID_123737 SeqID_123736 SeqID_121777 SeqID_121894SeqID_121896 AdoHcyase SeqID_122459 SeqID_123859S-adenosyl-L-homocysteine hydrolase Ribosomal_L15e SeqID_121969SeqID_122199 Ribosomal L15 SeqID_123299 SeqID_119488 SeqID_119941SeqID_120030 SeqID_123763 SeqID_123764 SeqID_124342 V-ATPase_CSeqID_120905 V-ATPase subunit C Proteasome SeqID_121422 SeqID_121276Proteasome A-type and B-type SeqID_121284 SeqID_121960 SeqID_121962SeqID_122045 SeqID_122055 SeqID_122190 SeqID_122396 SeqID_122413SeqID_122474 SeqID_122498 SeqID_121342 SeqID_122780 SeqID_122803SeqID_123336 SeqID_123362 SeqID_123419 SeqID_119278 SeqID_119909SeqID_121468 SeqID_120588 SeqID_120924 SeqID_123604 SeqID_123611SeqID_123646 SeqID_123681 SeqID_123825 SeqID_124136 SeqID_124227SeqID_121889 GMC_oxred_N SeqID_122311 SeqID_124037 GMC oxidoreductasePHF5 SeqID_122542 SeqID_120850 PHF5-like protein SeqID_123639DNA_gyraseB SeqID_120500 DNA gyrase B Cullin SeqID_122420 SeqID_119796Cullin family SeqID_119797 SeqID_120045 SeqID_123756 SeqID_120756SeqID_120773 SeqID_121682 SeqID_121780 SeqID_121941 DUF572 SeqID_119715SeqID_120330 Family of unknown function (DUF572) FAA_hydrolaseSeqID_122355 SeqID_124011 Fumarylacetoacetate (FAA) hydrolase famcNMP_binding SeqID_119563 SeqID_121604 Cyclic nucleotide-binding domainV-ATPase_G SeqID_122706 SeqID_123291 Vacuolar (H+)-ATPase G subunitSeqID_119894 SeqID_119927 SeqID_123872 V-ATPase_H SeqID_120190 V-ATPasesubunit H Epimerase SeqID_119815 SeqID_120284 NAD dependent SeqID_121143epimerase/dehydratase family Lipase_2 SeqID_119177 Lipase (class 2)Ribosomal_L39 SeqID_122520 SeqID_122799 Ribosomal L39 protein HCNGPSeqID_120697 SeqID_120698 HCNGP-like protein POP1 SeqID_123216Ribonucleases P/MRP protein subunit POP1 SMN SeqID_122580 SeqID_124165Survival motor neuron protein (SMN) ACPS SeqID_122149 SeqID_1241634′-phosphopantetheinyl transferase superfami Lamp SeqID_122690SeqID_123937 Lysosome-associated membrane glycoprotein (L FragX_IPSeqID_120422 Cytoplasmic Fragile-X interacting family Aminotran_1_2SeqID_122344 Aminotransferase class I and II ABC_tran SeqID_122402SeqID_119524 ABC transporter SeqID_119625 SeqID_120769 SeqID_123949 GRPSeqID_119747 SeqID_120570 Glycine rich protein family Vps54 SeqID_122919SeqID_119956 Vps54-like protein Aph-1 SeqID_122155 SeqID_123059 Aph-1protein SeqID_124013 Radical_SAM SeqID_120962 Radical SAM superfamilyJosephin SeqID_122683 SeqID_123774 Josephin SeqID_123928 EF1GSeqID_122057 SeqID_123454 Elongation factor 1 gamma, SeqID_120547SeqID_123557 conserved domain Monooxygenase SeqID_122333 SeqID_123771Monooxygenase EXS SeqID_120654 EXS family PCNA_C SeqID_122062SeqID_122819 Proliferating cell nuclear antigen, SeqID_120167 C-terminSad1_UNC SeqID_119813 SeqID_120250 Sad1/UNC-like C-terminal AMP-bindingSeqID_123152 SeqID_120233 AMP-binding enzyme DIM1 SeqID_123449SeqID_124285 Mitosis protein DIM1 ATP_bind_1 SeqID_122375 SeqID_122533Conserved hypothetical ATP SeqID_122872 SeqID_119782 binding proteinSeqID_123655 SeqID_123983 DUF652 SeqID_122800 SeqID_119259 Protein ofunknown function, SeqID_123553 SeqID_124216 DUF652 PCNA_N SeqID_122062SeqID_122819 Proliferating cell nuclear antigen, SeqID_122829SeqID_121061 N-termin SeqID_124283 DUF727 SeqID_122615 SeqID_119316Protein of unknown function SeqID_121702 SeqID_124046 (DUF727) Utp11SeqID_122295 SeqID_121156 Utp11 protein ThiF SeqID_121273 SeqID_122346ThiF family SeqID_119266 SeqID_121104 SeqID_124189 MMR_HSR1 SeqID_121288SeqID_122204 GTPase of unknown function SeqID_122360 SeqID_122375SeqID_122392 SeqID_122397 SeqID_122560 SeqID_122575 SeqID_122579SeqID_122600 SeqID_122619 SeqID_122657 SeqID_122671 SeqID_121329SeqID_122792 SeqID_122872 SeqID_123422 SeqID_119204 SeqID_121430SeqID_119311 SeqID_119332 SeqID_119352 SeqID_119367 SeqID_119492SeqID_119523 SeqID_119645 SeqID_119791 SeqID_120413 SeqID_120523SeqID_120766 SeqID_121613 SeqID_120900 SeqID_121038 SeqID_121093SeqID_123568 SeqID_123726 SeqID_123740 SeqID_123754 SeqID_121709SeqID_123960 SeqID_123950 SeqID_123988 SeqID_123991 SeqID_124180SeqID_121717 SeqID_121740 zf-C2H2 SeqID_122179 SeqID_122249 Zinc finger,C2H2 type SeqID_122267 SeqID_123033 SeqID_123091 SeqID_123110SeqID_123191 SeqID_119156 SeqID_119250 SeqID_119283 SeqID_119315SeqID_119326 SeqID_119455 SeqID_119515 SeqID_119638 SeqID_119674SeqID_119915 SeqID_119925 SeqID_120046 SeqID_120117 SeqID_120617SeqID_120707 SeqID_121133 SeqID_121172 SeqID_123906 SeqID_123914SeqID_124145 SeqID_121781 SeqID_121822 HEAT SeqID_122200 SeqID_122532HEAT repeat SeqID_121374 SeqID_121380 SeqID_122870 SeqID_119166SeqID_121402 SeqID_119389 SeqID_119427 SeqID_119658 SeqID_124126SeqID_119790 SeqID_119852 SeqID_119929 SeqID_121537 SeqID_119998SeqID_120055 SeqID_120163 SeqID_120538 SeqID_120549 SeqID_120581SeqID_120656 SeqID_121550 SeqID_121614 SeqID_121006 SeqID_121175SeqID_121201 SeqID_123600 SeqID_121708 PWI SeqID_122736 SeqID_122749 PWIdomain SeqID_120835 SeqID_124267 SeqID_124270 Syja_N SeqID_119649SeqID_120216 Sacl homology domain zf-Sec23_Sec24 SeqID_120593Sec23/Sec24 zinc finger Gcd10p SeqID_122163 SeqID_119505 Gcd10p familySeqID_124154 Gelsolin SeqID_119286 SeqID_121187 Gelsolin repeat FUN14SeqID_122589 FUN14 family UcrQ SeqID_122007 SeqID_122808 UcrQ familySeqID_119696 SeqID_124290 Ribosomal_L31e SeqID_122378 SeqID_123337Ribosomal protein L31e SeqID_123982 Ribosomal_L24e SeqID_122132SeqID_123616 Ribosomal protein L24e SeqID_123617 SeqID_124161Calreticulin SeqID_122265 SeqID_122837 Calreticulin family SeqID_119911SeqID_123897 eIF-5a SeqID_122554 SeqID_123204 Eukaryotic initiationfactor 5A SeqID_120378 SeqID_123582 hypusine, DN Pex14_N SeqID_122174SeqID_123766 Peroxisomal membrane anchor protein (Pex14p) DUF663SeqID_123123 SeqID_120018 Protein of unknown function (DUF663) UIMSeqID_122683 SeqID_123389 Ubiquitin interaction motif SeqID_123774 COX5ASeqID_122529 SeqID_120371 Cytochrome c oxidase subunit Va SeqID_123670COX5B SeqID_121281 SeqID_122395 Cytochrome c oxidase subunit VbSeqID_122479 SeqID_121428 SeqID_119688 SeqID_123743 SeqID_124156Ribosomal_L23eN SeqID_122038 SeqID_123046 Ribosomal protein L23,N-terminal SeqID_119380 SeqID_119408 dom SeqID_123694 PH SeqID_121274SeqID_123067 PH domain SeqID_119258 SeqID_119463 SeqID_120842SeqID_120929 SeqID_121878 GTP_EFTU_D2 SeqID_122391 SeqID_122657Elongation factor Tu domain 2 SeqID_121299 SeqID_122792 SeqID_123334SeqID_123345 SeqID_119311 SeqID_119352 SeqID_121538 SeqID_121551SeqID_120413 SeqID_120540 SeqID_123754 SeqID_123950 SeqID_123967SeqID_121805 SeqID_121825 SeqID_121826 Sas10_Utp3 SeqID_120054Sas10/Utp3 family Prp18 SeqID_123355 SeqID_120394 Prp18 domainGTP_EFTU_D3 SeqID_122729 SeqID_122792 Elongation factor Tu C-terminalSeqID_123334 SeqID_123345 domain SeqID_121538 SeqID_121551 SeqID_120413SeqID_123754 SeqID_121664 SeqID_121825 SeqID_121826 SeqID_121939 GATASeqID_121080 GATA zinc finger Spectrin SeqID_119246 SeqID_119482Spectrin repeat SeqID_119575 SeqID_120539 SeqID_120627 V-SNARESeqID_122066 SeqID_121095 Vesicle transport v-SNARE protein SeqID_123939Ribosomal_S5_C SeqID_122366 SeqID_123632 Ribosomal protein S5,C-terminal SeqID_121839 domai PX SeqID_120853 PX domain KID SeqID_120763KID repeat GSH_synth_ATP SeqID_122183 SeqID_121436 Eukaryoticglutathione synthase, SeqID_120021 SeqID_120101 ATP bi SeqID_120121SeqID_120409 SeqID_120622 SeqID_120666 SeqID_120719 SeqID_120782SeqID_120886 SeqID_120895 SeqID_121074 SeqID_123719 MCM SeqID_119742SeqID_119912 MCM2/3/5 family SeqID_120936 ETF_alpha SeqID_121607Electron transfer flavoprotein alpha subuni L51_S25_CI-B8 SeqID_122048SeqID_123231 Mitochondrial ribosomal protein SeqID_121114 SeqID_123645L51/S CBS SeqID_120735 SeqID_121213 CBS domain Ribosomal_L18eSeqID_122342 SeqID_122374 Eukaryotic ribosomal protein L18 SeqID_123713SeqID_123753 SeqID_121703 SeqID_124289 OTCace SeqID_119435Aspartate/omithine carbamoyltransterase, A GRAM SeqID_122481SeqID_123794 GRAM domain SeqID_121636 Rad21_Rec8 SeqID_122963 Conservedregion of Rad21/Rec8 like prot DUF676 SeqID_120421 Putative serineesterase (DUF676) Ribosomal_L18p SeqID_120495 SeqID_121687 RibosomalL18p/L5e family Metallophos SeqID_122191 SeqID_122213 Calcineurin-likephosphoesterase SeqID_122289 SeqID_123311 SeqID_123307 SeqID_121426SeqID_119337 SeqID_119448 SeqID_119502 SeqID_119572 SeqID_119659SeqID_119751 SeqID_119860 SeqID_121540 SeqID_120128 SeqID_120238SeqID_120434 SeqID_120484 SeqID_120553 SeqID_120710 SeqID_120825SeqID_120824 SeqID_120834 SeqID_120873 SeqID_121046 SeqID_121142SeqID_121188 SeqID_121700 SeqID_124029 SeqID_124030 SeqID_124061SeqID_121738 SeqID_121746 SeqID_121770 SeqID_121899 SeqID_121900SeqID_121915 HECT SeqID_121283 SeqID_121316 HECT-domain (ubiquitin-SeqID_119865 SeqID_120854 transferase) SeqID_121657 SeqID_121712Hormone_recep SeqID_120699 SeqID_121758 Ligand-binding domain of nuclearhormon NAC SeqID_122153 SeqID_122689 NAC domain SeqID_123502SeqID_120888 SeqID_123742 C1_1 SeqID_120222 SeqID_120526 Phorbolesters/diacylglycerol binding domain Calponin SeqID_121222 SeqID_122248Calponin family repeat SeqID_122936 SeqID_123005 SeqID_123171SeqID_123188 SeqID_123400 SeqID_120392 SeqID_120928 SeqID_120927SeqID_123804 SeqID_123842 SeqID_123853 SeqID_124187 RmaAD SeqID_122723SeqID_119487 Ribosomal RNA adenine SeqID_123996 dimethylase SPXSeqID_120654 SPX domain C1_3 SeqID_119379 SeqID_120217 C1-like domainSeqID_120788 GST_C SeqID_122057 SeqID_122407 Glutathione S-transferase,C- SeqID_122628 SeqID_122642 terminal domain SeqID_121383 SeqID_123011SeqID_121623 SeqID_121132 SeqID_123557 SeqID_123625 SeqID_123709SeqID_123708 SeqID_124298 SeqID_121838 Na_Ca_ex SeqID_120053Sodium/calcium exchanger protein B3_4 SeqID_120243 B3/4 domainSec23_helical SeqID_119286 Sec23/Sec24 helical domain Ribosomal_L40eSeqID_122033 SeqID_121408 Ribosomal L40e family SeqID_119857SeqID_123721 ICIn_channel SeqID_121998 SeqID_123531 Nucleotide-sensitivechloride conductanc Histone SeqID_121234 SeqID_122039 Core histoneH2A/H2B/H3/H4 SeqID_122046 SeqID_122096 SeqID_122119 SeqID_122455SeqID_122483 SeqID_122582 SeqID_122688 SeqID_122744 SeqID_122766SeqID_122771 SeqID_121314 SeqID_122884 SeqID_122955 SeqID_123003SeqID_123317 SeqID_123463 SeqID_123466 SeqID_123473 SeqID_121448SeqID_121446 SeqID_119330 SeqID_119493 SeqID_119512 SeqID_119521SeqID_119609 SeqID_119776 SeqID_119983 SeqID_121479 SeqID_121482SeqID_120068 SeqID_120077 SeqID_120480 SeqID_120534 SeqID_120790SeqID_121570 SeqID_121594 SeqID_121601 SeqID_120813 SeqID_121054SeqID_123744 SeqID_121645 SeqID_121663 SeqID_121675 SeqID_121683SeqID_121684 SeqID_121688 SeqID_121706 SeqID_121705 SeqID_124105SeqID_124164 SeqID_124258 SeqID_124262 SeqID_124276 SeqID_124314SeqID_121771 SeqID_121809 SeqID_121819 SeqID_121828 SeqID_121823SeqID_121905 Disintegrin SeqID_120481 Disintegrin 3HCDH SeqID_1204473-hydroxyacyl-CoA dehydrogenase, C-terminal NAP SeqID_122303SeqID_119982 Nucleosome assembly protein SeqID_124050 (NAP) TubulinSeqID_122269 SeqID_122332 Tubulin/FtsZ family, GTPase SeqID_122440SeqID_121336 domain SeqID_123013 SeqID_119381 SeqID_119720 SeqID_124114SeqID_121499 SeqID_121506 SeqID_121543 SeqID_120775 SeqID_121555SeqID_123545 SeqID_123884 SeqID_124069 SeqID_121761 SeqID_121820SeqID_121849 SeqID_121893 GST_N SeqID_122057 SeqID_122407 GlutathioneS-transferase, N- SeqID_122628 SeqID_122642 terminal domain SeqID_121383SeqID_123011 SeqID_123274 SeqID_119360 SeqID_121496 SeqID_120438SeqID_121623 SeqID_123557 SeqID_123625 SeqID_123709 SeqID_123708SeqID_124297 SeqID_121838 ETC_C1_NDUFA5 SeqID_122406 SeqID_123873 ETCcomplex I subunit conserved region 2-Hacid_dh SeqID_120275 D-isomerspecific 2-hydroxyacid dehydrogen Adenylsucc_synt SeqID_122272SeqID_121355 Adenylosuccinate synthetase SeqID_119162 SeqID_119650SeqID_120090 SeqID_120822 SeqID_124006 RTC SeqID_122518 SeqID_123729 RNA3′-terminal phosphate cyclase Ribosomal_L19e SeqID_122613 SeqID_122869Ribosomal protein L19e SeqID_123303 SeqID_120779 TRAPP_Bet3 SeqID_122875Transport protein particle (TRAPP) compone SMC_C SeqID_119625 SMCfamily, C-terminal domain CDP-OH_P_transf SeqID_123356 CDP-alcoholphosphatidyltransferase Frataxin_Cyay SeqID_122075 SeqID_123508Frataxin-like domain SeqID_120847 SeqID_124330 VHS SeqID_120912 VHSdomain DUF689 SeqID_122204 Protein of unknown function (DUF689) SMC_NSeqID_119887 SeqID_119966 RecF/RecN/SMC N terminal SeqID_120574SeqID_120702 domain PTPLA SeqID_120767 Protein tyrosine phosphatase-likeprotein, P PfkB SeqID_122553 SeqID_123583 pfkB family carbohydratekinase DSPc SeqID_122616 SeqID_122746 Dual specificity phosphatase,SeqID_124323 SeqID_119299 catalytic doma SeqID_119527 SeqID_120116SeqID_123836 SeqID_124056 Biotin_lipoyl SeqID_120976 Biotin-requiringenzyme Pkinase_C SeqID_122758 SeqID_122978 Protein kinase C terminaldomain SeqID_119365 SeqID_121625 SeqID_120796 DAD SeqID_119716 DADfamily Alpha_adaptin_C SeqID_120950 Alpha adaptin AP2, C-terminal domainRibosomal_L6e SeqID_122489 SeqID_122949 Ribosomal protein L6eSeqID_121011 SeqID_123683 S1 SeqID_122067 SeqID_123689 S1 RNA bindingdomain SeqID_123690 Oxidored_q6 SeqID_121980 SeqID_122154 NADHubiquinone oxidoreductase, SeqID_119667 SeqID_124082 20 Kd subExtensin_2 SeqID_120572 SeqID_120593 Extensin-like region SeqID_120713Gar1 SeqID_119747 Gar1 protein RNA binding region S4 SeqID_122623SeqID_121533 S4 domain SeqID_121546 SeqID_120098 SeqID_124001Bromodomain SeqID_119794 Bromodomain Laminin_N SeqID_119297 SeqID_119335Laminin N-terminal (Domain VI) CDI SeqID_122878 Cyclin-dependent kinaseinhibitor Mago_nashi SeqID_123521 SeqID_120904 Mago nashi protein SNF7SeqID_122586 SeqID_119719 SNF7 SeqID_119746 SeqID_120690 SeqID_121136SeqID_124098 ShTK SeqID_122581 SeqID_119601 ShTK domain SeqID_120015SeqID_120669 tRNA_anti SeqID_122633 SeqID_120602 OB-fold nucleic acidbinding SeqID_124327 domain Linker_histone SeqID_122931 SeqID_120058linker histone H1 and H5 family DAO SeqID_122333 SeqID_122338 FADdependent oxidoreductase SeqID_122411 SeqID_120662 SeqID_121620SeqID_123933 NDUF_B7 SeqID_120001 SeqID_122009 NADH-ubiquinoneoxidoreductase SeqID_122777 SeqID_123519 B18 subunit ( SeqID_123938SeqID_124080 Ribosomal_L34e SeqID_122631 SeqID_123111 Ribosomal proteinL34e SeqID_123332 SeqID_119760 SeqID_123896 DUF906 SeqID_119497 Domainof Unknown Function (DUF906) SPC12 SeqID_122753 SeqID_121566 Microsomalsignal peptidase 12 kDa SeqID_121590 SeqID_124248 subunit ( CLN3SeqID_123346 SeqID_120607 CLN3 protein SeqID_120681 RVT_1 SeqID_119187SeqID_119374 Reverse transcriptase (RNA- SeqID_119417 SeqID_119451dependent DNA pol SeqID_119461 SeqID_119462 SeqID_119508 SeqID_119566SeqID_119662 SeqID_119663 SeqID_119834 SeqID_119855 SeqID_119943SeqID_120004 SeqID_120103 SeqID_120138 SeqID_120352 SeqID_120433SeqID_120465 SeqID_120471 SeqID_120546 SeqID_120566 SeqID_120586SeqID_120687 SeqID_120716 SeqID_120852 SeqID_120935 SeqID_121043SeqID_121058 SeqID_121128 SeqID_121129 SeqID_121179 Gp_dh_C SeqID_121978SeqID_122147 Glyceraldehyde 3-phosphate SeqID_121379 SeqID_121400dehydrogenase, C- SeqID_121457 SeqID_119718 SeqID_123607 Ldi_recept_bSeqID_119985 SeqID_120019 Low-density lipoprotein receptor repeatF_actin_cap_B SeqID_122095 SeqID_123254 F-actin capping protein, betaSeqID_123555 subunit Methyltransf_8 SeqID_122377 SeqID_119368Hypothetical methyltransferase SeqID_123981 Mt_ATP-synt_B SeqID_122761SeqID_120653 Mitochondrial ATP synthase B SeqID_124245 SeqID_121789chain prec KAP_NTPase SeqID_120368 KAP family P-loop domainMt_ATP-synt_D SeqID_122314 SeqID_123912 ATP synthase D chain,mitochondrial (AT SAC3_GANP SeqID_121169 SAC3/GANP family Gp_dh_NSeqID_121986 SeqID_122098 Glyceraldehyde 3-phosphate SeqID_122147SeqID_123064 dehydrogenase, NA SeqID_123079 SeqID_123357 SeqID_123457SeqID_121379 SeqID_121457 SeqID_123840 SeqID_123607 SeqID_123870SeqID_124339 SeqID_121930 An_peroxidase SeqID_120092 SeqID_120241 Animalhaem peroxidase SeqID_120906 Ephrin SeqID_119824 Ephrin polyprenyl_syntSeqID_120635 Polyprenyl synthetase Neur_chan_memb SeqID_121005SeqID_121087 Neurotransmitter-gated ion- channel tra zf-NPL4SeqID_123465 SeqID_119522 NPL4 family, putative zinc binding region XAP5SeqID_122814 SeqID_122850 XAP5 protein SeqID_121167 RNA_pol SeqID_120255DNA-dependent RNA polymerase NMT_C SeqID_121407 SeqID_120382Myristoyl-CoA: protein N- myristoyltransferase Aldose_epim SeqID_122507SeqID_120204 Aldose 1-epimerase SeqID_123603 DUF841 SeqID_120243Eukaryotic protein of unknown function (DUF8 Mov34 SeqID_121290SeqID_121294 Mov34/MPN/PAD-1 family SeqID_122120 SeqID_122260SeqID_122735 SeqID_121344 SeqID_122844 SeqID_123535 SeqID_121382SeqID_121417 SeqID_119999 SeqID_120711 SeqID_123682 SeqID_121693SeqID_124040 SeqID_124087 SeqID_121736 NAD_binding_1 SeqID_122444SeqID_123823 Oxidoreductase NAD-binding domain Ribosomal_L28eSeqID_122648 SeqID_120682 Ribosomal L28e protein family SeqID_123720 LIMSeqID_122321 SeqID_123039 LIM domain SeqID_123158 SeqID_123185SeqID_119388 SeqID_119555 SeqID_119701 SeqID_120218 SeqID_123816SeqID_121638 SeqID_124168 SPC25 SeqID_122737 SeqID_124254 Microsomalsignal peptidase 25 kDa subunit ( WGR SeqID_121544 WGR domain STT3SeqID_120152 Oligosaccharyl transferase STT3 subun WH2 SeqID_119377 WH2motif 14-3-3 SeqID_122166 SeqID_122862 14-3-3 protein SeqID_123514SeqID_119268 SeqID_119536 SeqID_119605 Alpha_adaptinC2 SeqID_120950Adaptin C-terminal domain CbiA SeqID_122680 CobQ/CobB/MinD/ParAnucleotide binding do zf-MIZ SeqID_120096 MIZ zinc finger LipocalinSeqID_122101 SeqID_122172 Lipocalin/cytosolic fatty-acid SeqID_123222SeqID_119931 binding pr SeqID_120164 SeqID_123572 SeqID_123598SeqID_121698 DLIC SeqID_122981 Dynein light intermediate chain (DLIC)tRNA-synt_1c_C SeqID_120890 tRNA synthetases class I (E and Q), anBestrophin SeqID_119636 Bestrophin eIF-3_zeta SeqID_122685 SeqID_120237Eukaryotic translation initiation SeqID_123560 factor 3 Porin_3SeqID_122330 SeqID_121328 Eukaryotic porin SeqID_119190 SeqID_121557SeqID_121575 SeqID_123999 ARID SeqID_120573 ARID/BRIGHT DNA bindingdomain CybS SeqID_122786 SeqID_124202 CybS BCAS2 SeqID_122510SeqID_123556 Breast carcinoma amplified sequence 2 (BCAS2 Motile_SpermSeqID_122721 SeqID_123310 MSP (Major sperm protein) domain SeqID_119174SeqID_119175 SeqID_119281 SeqID_119604 SeqID_119641 SeqID_119724SeqID_120035 SeqID_120095 SeqID_120327 SeqID_120441 SeqID_120517SeqID_120563 SeqID_120632 SeqID_120633 SeqID_120910 SeqID_124175SeqID_121850 Transket_pyr SeqID_122651 SeqID_119346 Transketolase,pyridine binding SeqID_120804 SeqID_120844 domai SeqID_123846Fibrillarin SeqID_123301 SeqID_119795 Fibrillarin PABP SeqID_122526SeqID_121469 Poly-adenylate binding protein, SeqID_123820 SeqID_121869unique domai BRCT SeqID_121491 SeqID_120459 BRCA1 C Terminus (BRCT)SeqID_121621 domain Psf2 SeqID_119632 Partner of SLD five, PSF2tRNA-synt_1 SeqID_123431 SeqID_119678 tRNA synthetases class I (I, L, MSeqID_121502 and V) Psf3 SeqID_122668 SeqID_123586 Partner of SLD five,PSF3 tRNA-synt_2 SeqID_122301 SeqID_124123 tRNA synthetases class II (D,K SeqID_119833 SeqID_124052 and N) SeqID_121567 SeqID_121591SeqID_121045 NDK SeqID_121977 SeqID_122294 Nucleoside diphosphate kinaseSeqID_121345 SeqID_123201 SeqID_123452 SeqID_123451 SeqID_121388SeqID_123559 ATP-synt_DE_N SeqID_122751 SeqID_121401 ATP synthase,Delta/Epsilon SeqID_124264 SeqID_123752 chain, beta zf-C4 SeqID_122310SeqID_122617 Zinc-finger, C4 type (two domains) SeqID_122812SeqID_123080 SeqID_123445 SeqID_119666 SeqID_120699 SeqID_120820SeqID_124333 DIRP SeqID_119217 DIRP Ribosomal_L36e SeqID_122112SeqID_123552 Ribosomal protein L36e Filament SeqID_121260 SeqID_122400Intermediate filament protein SeqID_122546 SeqID_123386 SeqID_119327SeqID_121473 SeqID_120922 SeqID_123579 SeqID_123618 SeqID_123953TFIID_30kDa SeqID_120863 Transcription initiation factor TFIID 23-DUF926 SeqID_120692 Domain of Unknown Function (DUF926) DUF854SeqID_119289 SeqID_120777 Caenorhabditis elegans repeat of unknown funTPP_enzyme_M SeqID_120595 Thiamine pyrophosphate enzyme, central dPPI_Ypi1 SeqID_122389 SeqID_120052 Protein phosphatase inhibitorSeqID_123885 Myosin_head SeqID_119861 Myosin head (motor domain) MH1SeqID_122197 SeqID_123448 MH1 domain SeqID_121021 RWD SeqID_122537SeqID_120442 RWD domain DUF858 SeqID_122961 Eukaryotic protein ofunknown function (DUF8 3Beta_HSD SeqID_123272 SeqID_121396 3-betahydroxysteroid SeqID_119630 SeqID_121143 dehydrogenase/isomera BIRSeqID_123175 SeqID_119442 Inhibitor of Apoptosis domain SeqID_121665MTHFR SeqID_122188 SeqID_120829 Methylenetetrahydrofolate SeqID_124115reductase GYF SeqID_121994 GYF domain E1_dh SeqID_122564 SeqID_122651Dehydrogenase E1 component SeqID_123263 SeqID_123821 SeqID_123846Fork_head SeqID_120305 Fork head domain DUF1604 SeqID_119654 Protein ofunknown function (DUF1604) OST3_OST6 SeqID_123025 OST3/OST6 familyCadherin SeqID_119235 SeqID_119705 Cadherin domain PPTA SeqID_122418SeqID_123390 Protein prenyltransferase alpha SeqID_121057 SeqID_123924subunit repe GCV_H SeqID_122462 SeqID_123213 Glycine cleavage H-proteinSeqID_119530 Aldolase_II SeqID_121238 SeqID_122401 Class II Aldolase andAdducin N- SeqID_122499 SeqID_121462 terminal SeqID_121475 SeqID_123767SeqID_121689 SeqID_123952 SeqID_121934 AIG1 SeqID_119367 AIG1 familyRNase_PH SeqID_122716 SeqID_122887 3′ exoribonuclease family, domain 1SeqID_120679 SeqID_124199 Ribosomal_L18ae SeqID_122646 SeqID_122664Ribosomal L18ae protein family SeqID_122858 SeqID_123114 SeqID_123121SeqID_120878 SeqID_123624 SeqID_123886 SeqID_124236 SeqID_121791SeqID_121797 SeqID_121925 Nucleoside_tran SeqID_123441 SeqID_124095Nucleoside transporter Ribosomal_L37e SeqID_122591 SeqID_121170Ribosomal protein L37e SeqID_123963 Prefoldin SeqID_120211 SeqID_123679Prefoldin subunit Beta-lactamase SeqID_122907 SeqID_120056Beta-lactamase PC_rep SeqID_122969 Proteasome/cyclosome repeat DEADSeqID_122207 SeqID_122277 DEAD/DEAH box helicase SeqID_122318SeqID_122319 SeqID_122381 SeqID_123007 SeqID_123018 SeqID_119170SeqID_121414 SeqID_121443 SeqID_119233 SeqID_119256 SeqID_119300SeqID_119324 SeqID_119657 SeqID_119712 SeqID_119766 SeqID_120168SeqID_120342 SeqID_120509 SeqID_120575 SeqID_121032 SeqID_121200SeqID_123835 SeqID_123976 SeqID_124021 SeqID_124074 SeqID_124119 SURF4SeqID_120816 SURF4 family NCD3G SeqID_120999 Nine Cysteines Domain offamily 3 GPCR SURF6 SeqID_122863 Surfeit locus protein 6 Sec10SeqID_120206 Exocyst complex component Sec10 Oxidored_molyb SeqID_122426SeqID_123908 Oxidoreductase molybdopterin binding d Cation_effluxSeqID_119547 SeqID_119854 Cation efflux family HisKA_2 SeqID_121848Histidine kinase RNA_pol_Rpb5_C SeqID_122441 SeqID_123528 RNA polymeraseRpb5, C-terminal SeqID_123883 domain dUTPase SeqID_120762 dUTPaseCalx-beta SeqID_120919 Calx-beta domain FA_desaturase SeqID_122220SeqID_122258 Fatty acid desaturase BRF1 SeqID_119280 Brf1-likeTBP-binding domain W2 SeqID_119265 SeqID_121487eIF4-gamma/eIF5/eIF2-epsilon PIP5K SeqID_123028Phosphatidylinositol-4-phosphate 5-Kinase Ribosomal_L35Ae SeqID_122114SeqID_122827 Ribosomal protein L35Ae SeqID_123435 SeqID_123889SeqID_124288 RTC_insert SeqID_122518 SeqID_120776 RNA 3′-terminalphosphate cyclase SeqID_123973 (RTC), i SKIP_SNW SeqID_121033 SKIP/SNWdomain PAP_assoc SeqID_122124 SeqID_123088 PAP/25A associated domainSeqID_120195 DNA_pol_E_B SeqID_122568 SeqID_124351 DNA polymeraseepsilon subunit B RNA_pol_Rpb5_N SeqID_122441 SeqID_123137 RNApolymerase Rpb5, N-terminal SeqID_123528 SeqID_119977 domainSeqID_123883 Vicilin_N SeqID_121108 Vicilin N terminal region DEPSeqID_120527 Domain found in Dishevelled, Egl- 10, and Ple Cytochrom_CSeqID_122036 SeqID_123651 Cytochrome c Ribosomal_L38e SeqID_122040SeqID_119735 Ribosomal L38e protein family SeqID_123834 GRIM-19SeqID_123149 SeqID_119221 GRIM-19 protein SeqID_121885 DUF947SeqID_120569 Domain of unknown function (DUF947) DnaJ SeqID_121245SeqID_122232 DnaJ domain SeqID_122521 SeqID_122541 SeqID_122667SeqID_122722 SeqID_119277 SeqID_119288 SeqID_119644 SeqID_120879SeqID_120983 SeqID_121206 SeqID_123640 SeqID_123715 SeqID_123882SeqID_124112 SeqID_124311 G6PD_C SeqID_122743 SeqID_124271Glucose-6-phosphate dehydrogenase, C-termina PHO4 SeqID_122752SeqID_120810 Phosphate transporter family SeqID_124256 ReprolysinSeqID_120801 Reprolysin (M12B) family zinc metalloprote MIT SeqID_120407MIT domain LRR_1 SeqID_122148 SeqID_122601 Leucine Rich RepeatSeqID_119275 SeqID_119350 SeqID_120299 SeqID_120639 SeqID_121187SeqID_123576 SeqID_123887 Ribosomal_S21e SeqID_123512 SeqID_119578Ribosomal protein S21e SeqID_123570 tRNA-synt_1d_C SeqID_122724 DALRanticodon binding domain RNA_pol_A_bac SeqID_120230 SeqID_121039 RNApolymerase Rpb3/RpoA insert domain KOW SeqID_122127 SeqID_122516 KOWmotif SeqID_122554 SeqID_122701 SeqID_122710 SeqID_123295 SeqID_123458SeqID_119904 SeqID_123548 SeqID_123582 SeqID_123634 SeqID_123734 ECHSeqID_122281 SeqID_122454 Enoyl-CoA hydratase/isomerase SeqID_119372SeqID_123865 family SeqID_124071 SeqID_121747 IF_tail SeqID_121260SeqID_122400 Intermediate filament tail domain SeqID_120920 SeqID_123579PAN SeqID_120695 PAN domain zf-C3HC4 SeqID_122347 SeqID_122496 Zincfinger, C3HC4 type (RING SeqID_123384 SeqID_123511 finger) SeqID_119241SeqID_119357 SeqID_120644 SeqID_123692 SeqID_124075 WW SeqID_122257SeqID_122416 WW domain SeqID_119608 SeqID_120846 SeqID_120854SeqID_123538 PB1 SeqID_120217 PB1 domain NOG1 SeqID_119367 NucleolarGTP-binding protein 1 (NOG1) PAS SeqID_122206 PAS domain EI24SeqID_121718 Etoposide-induced protein 2.4 (EI24) MIF4G SeqID_120041MIF4G domain PI3_PI4_kinase SeqID_119291 SeqID_119922Phosphatidylinositol 3- and 4- SeqID_120521 SeqID_121582 kinaseSeqID_120853 SeqID_121831 PAZ SeqID_121243 SeqID_119347 PAZ domainSeqID_119944 SeqID_121539 SeqID_120129 SeqID_121552 SeqID_121086SeqID_121913 Cpn60_TCP1 SeqID_122080 SeqID_122245 TCP-1/cpn60 chaperoninfamily SeqID_123243 SeqID_123369 SeqID_119577 SeqID_119896 SeqID_121517SeqID_120191 SeqID_120292 SeqID_120613 SeqID_120876 SeqID_121091SeqID_123615 SeqID_123735 SeqID_121655 SeqID_123955 Tim17 SeqID_122097SeqID_124320 Tim17/Tim22/Tim23 family SeqID_121063 Ligase_CoASeqID_122725 SeqID_124060 CoA-ligase Trehalase SeqID_121616 TrehalasePQ-loop SeqID_122891 SeqID_123245 PQ loop repeat SeqID_123591 TTLSeqID_119185 Tubulin-tyrosine ligase family Myb_DNA-binding SeqID_119288SeqID_119345 Myb-like DNA-binding domain SeqID_120201 SeqID_121080SeqID_121197 Ribonuclease_3 SeqID_121044 RNase3 domain Ribophorin_ISeqID_121320 SeqID_120333 Ribophorin I SeqID_121666 CAS_CSE1SeqID_119961 CAS/CSE protein, C-terminus Pex2_Pex12 SeqID_122740SeqID_123980 Pex2/Pex12 amino terminal region Table 1 Legend: Column 1 -pfam name or designation Column 2 - gene family member listed by SEQ IDNO corresponding to amino acid sequence translation from vcDNA SEQ ID NOidentified in feature field of peptide sequence Column 3 - Proteinannotation based on BLASTP comparisons

In order to construct a dsRNA sequence, or concatamers or chimeras ofdsRNA sequences from various genes either within SCN, from other pestnucleotide sequences, or a combination thereof, nucleotide sequencescorresponding to the SCN genome sequences were BLASTed against knownvertebrate, soybean, and Rhizobium nucleotide sequences to firsteliminate contiguous sequences in SCN that matched sequences in knownvertebrate, soybean, and Rhozobium sequences that are at least about 21nucleotides in length. This redacted set of SCN sequences was thencompared to known nucleotide sequences in parasitic nematodes, insects,and fungi to identify sequences of substantial identity that could beuseful in constructing sequences that, when expressed as a dsRNAsequence, are capable of effecting gene suppression in SCN as well as inanother parasitic nematode, or insect, or fungal pest. The results ofcomparisons to other parasitic nematode sequences are shown in Table 2.

TABLE 2 Diverse Parasitic Nematode Nucleotide Coding Sequences MatchingH. glycines vcDNA sequences H. glycines Sequence¹ Position² GeneID³Position⁴ % identity⁵ Seq ID NO: 50950 21-138 gi|28916076 1-119 91% SeqID NO: 50950 14-41 gi|159473 34-61 100% Seq ID NO: 50950 14-41gi|2454547 227-200 100% Seq ID NO: 50950 14-41 gi|551594 194-221 100%Seq ID NO: 50950 14-41 gi|551595 669-696 100% Seq ID NO: 50950 14-41gi|18032254 34-61 100% Seq ID NO: 50950 18-41 gi|18477256 279-256 100%Seq ID NO: 50950 18-41 gi|18477260 325-302 100% Seq ID NO: 50950 18-41gi|18477262 634-611 100% Seq ID NO: 50950 14-35 gi|18477259 22-1 100%Seq ID NO: 50950 14-35 gi|18477261 22-1 100% Seq ID NO: 50950 14-35gi|37780968 22-1 100% Seq ID NO: 50973 88-108 gi|19267657 465-445 100%Seq ID NO: 51012 302-411 gi|18080245 178-69 84% Seq ID NO: 51132 182-360gi|32323955 5-183 96% Seq ID NO: 51132 184-359 gi|33139778 6-182 96% SeqID NO: 51132 182-351 gi|33140274 5-174 95% Seq ID NO: 51132 195-360gi|32324510 1-166 95% Seq ID NO: 51132 218-354 gi|33140264 1-137 95% SeqID NO: 51132 182-284 gi|32324219 5-107 94% Seq ID NO: 51164 117-137gi|22543645 32-12 100% Seq ID NO: 51169 1-236 gi|47118285 740-975 97%Seq ID NO: 51169 1-236 gi|47118286 753-988 97% Seq ID NO: 51169 1-219gi|16797830 731-950 97% Seq ID NO: 51169 1-219 gi|21885260 751-969 97%Seq ID NO: 51169 1-219 gi|26000759 727-945 97% Seq ID NO: 51169 1-219gi|31442322 728-946 97% Seq ID NO: 51169 1-219 gi|31442320 728-946 96%Seq ID NO: 51169 4-219 gi|16797831 732-947 96% Seq ID NO: 51169 1-219gi|38096133 730-948 94% Seq ID NO: 51169 1-33 gi|16797827 776-807 96%Seq ID NO: 51169 131-159 gi|16797844 862-890 96% Seq ID NO: 51169216-236 gi|48479719 1112-1132 100% Seq ID NO: 51184 16-47 gi|32324419271-302 100% Seq ID NO: 51378 184-459 gi|32325335 36-311 97% Seq ID NO:51378 504-626 gi|32325335 311-433 100% Seq ID NO: 51378 753-841gi|32325335 453-541 97% Seq ID NO: 51378 101-139 gi|32325335 1-39 100%Seq ID NO: 51378 676-702 gi|32325335 430-456 100% Seq ID NO: 51561402-476 gi|33139584 80-6 100% Seq ID NO: 51561 192-229 gi|33139584115-78 100% Seq ID NO: 51561 192-215 gi|6969910 34-11 100% Seq ID NO:51579 1-138 gi|33139914 209-346 97% Seq ID NO: 51823 55-187 gi|1808106080-212 86% Seq ID NO: 51824 3-97 gi|54548889 100-194 87% Seq ID NO:51824 3-97 gi|18089540 421-515 85% Seq ID NO: 51824 3-79 gi|18081060404-480 87% Seq ID NO: 51978 1-314 gi|33139071 85-397 96% Seq ID NO:51978 307-395 gi|33139071 406-494 97% Seq ID NO: 51978 406-452gi|33139071 505-551 97% Seq ID NO: 51978 406-475 gi|32325389 507-576 97%Seq ID NO: 51978 1-237 gi|33140521 87-323 94% Seq ID NO: 51991 239-260gi|30028357 64-85 100% Seq ID NO: 52051 1-78 gi|33140098 496-573 100%Seq ID NO: 52051 1-74 gi|32324423 496-569 100% Seq ID NO: 52051 1-61gi|33139933 499-559 100% Seq ID NO: 52051 1-58 gi|33139533 495-552 100%Seq ID NO: 52051 56-168 gi|18082418 136-248 85% Seq ID NO: 52162 163-185gi|40670360 123-101 100% Seq ID NO: 52240 388-410 gi|38513673 44-22 100%Seq ID NO: 52285 172-204 gi|18090560 293-325 100% Seq ID NO: 52293 1-133gi|54549200 31-163 91% Seq ID NO: 52293 11-133 gi|7144024 38-160 91% SeqID NO: 52293 30-133 gi|18090416 1-104 91% Seq ID NO: 52293 11-133gi|54545022 9-131 88% Seq ID NO: 52293 41-133 gi|54545591 1-93 91% SeqID NO: 52311 1-75 gi|30028941 235-309 96% Seq ID NO: 52311 1-75gi|33140204 235-309 96% Seq ID NO: 52432 1-102 gi|35504996 472-574 96%Seq ID NO: 52432 1-58 gi|30028727 523-580 100% Seq ID NO: 52432 1-54gi|30028246 523-576 100% Seq ID NO: 52432 1-39 gi|35505048 461-499 100%Seq ID NO: 52438 179-210 gi|18383056 1007-976 100% Seq ID NO: 525691-311 gi|32324631 49-359 95% Seq ID NO: 52596 77-218 gi|30028020 420-27997% Seq ID NO: 52642 79-324 gi|33139639 249-493 86% Seq ID NO: 5264266-218 gi|33140673 201-49 91% Seq ID NO: 52642 286-342 gi|32324211207-263 91% Seq ID NO: 52710 25-152 gi|33140374 1-128 94% Seq ID NO:52762 1-90 gi|34105813 1098-1009 98% Seq ID NO: 52762 1-90 gi|27077481096-1007 97% Seq ID NO: 52762 1-90 gi|51093962 1079-990 97% Seq ID NO:52762 1-90 gi|31376322 1040-952 95% Seq ID NO: 52762 1-90 gi|310742781105-1017 94% Seq ID NO: 52762 1-90 gi|31376323 1036-948 94% Seq ID NO:52762 1-90 gi|6983959 1059-971 94% Seq ID NO: 52762 3-90 gi|310742791103-1017 94% Seq ID NO: 52762 1-87 gi|34105806 1106-1020 94% Seq ID NO:52762 1-87 gi|30844180 1105-1019 93% Seq ID NO: 52762 1-90 gi|305258121092-1002 92% Seq ID NO: 52762 1-90 gi|31376331 1031-942 92% Seq ID NO:52762 1-90 gi|54873771 134-44 92% Seq ID NO: 52762 1-64 gi|341058101104-1041 98% Seq ID NO: 52831 1-47 gi|32324245 540-586 100% Seq ID NO:52831 1-40 gi|33139824 507-546 100% Seq ID NO: 52831 2-101 gi|18089712314-413 84% Seq ID NO: 52831 1-29 gi|33140223 507-535 100% Seq ID NO:52831 2-52 gi|7143499 320-371 86% Seq ID NO: 52831 1-23 gi|33140453507-529 100% Seq ID NO: 52854 153-174 gi|21378611 322-301 100% Seq IDNO: 52965 21-138 gi|28916076 1-119 91% Seq ID NO: 52965 14-41 gi|15947334-61 100% Seq ID NO: 52965 14-41 gi|2454547 227-200 100% Seq ID NO:52965 14-41 gi|551594 194-221 100% Seq ID NO: 52965 14-41 gi|551595669-696 100% Seq ID NO: 52965 14-41 gi|18032254 34-61 100% Seq ID NO:52965 18-41 gi|18477256 279-256 100% Seq ID NO: 52965 18-41 gi|18477260325-302 100% Seq ID NO: 52965 18-41 gi|18477262 634-611 100% Seq ID NO:52965 14-35 gi|18477259 22-1 100% Seq ID NO: 52965 14-35 gi|1847726122-1 100% Seq ID NO: 52965 14-35 gi|37780968 22-1 100% Seq ID NO: 52976617-707 gi|32325459 205-290 83% Seq ID NO: 52981 15-425 gi|3314013692-502 87% Seq ID NO: 53038 793-899 gi|18090533 269-375 89% Seq ID NO:53058 469-594 gi|18081946 320-195 90% Seq ID NO: 53058 172-199gi|1766021 225-252 96% Seq ID NO: 53058 172-199 gi|7641281 173-200 96%Seq ID NO: 53110 73-218 gi|33139131 99-244 89% Seq ID NO: 53110 79-218gi|32324211 1-140 88% Seq ID NO: 53110 68-218 gi|33140673 199-49 85% SeqID NO: 53110 86-218 gi|33139639 256-388 84% Seq ID NO: 53110 288-314gi|33139639 467-493 96% Seq ID NO: 53113 15-54 gi|28916076 1-40 97% SeqID NO: 53113 8-35 gi|159473 34-61 100% Seq ID NO: 53113 8-35 gi|2454547227-200 100% Seq ID NO: 53113 8-35 gi|551594 194-221 100% Seq ID NO:53113 8-35 gi|551595 669-696 100% Seq ID NO: 53113 8-35 gi|1803225434-61 100% Seq ID NO: 53113 12-35 gi|18477256 279-256 100% Seq ID NO:53113 12-35 gi|18477260 325-302 100% Seq ID NO: 53113 12-35 gi|18477262634-611 100% Seq ID NO: 53113 8-29 gi|18477259 22-1 100% Seq ID NO:53113 8-29 gi|18477261 22-1 100% Seq ID NO: 53113 8-29 gi|37780968 22-1100% Seq ID NO: 53345 33-68 gi|32324562 547-512 97% Seq ID NO: 53381122-150 gi|18382995 332-304 96% Seq ID NO: 53444 20-42 gi|8005191 86-64100% Seq ID NO: 53584 1-223 gi|33139862 41-263 99% Seq ID NO: 53584266-445 gi|33139862 262-441 99% Seq ID NO: 53584 1-129 gi|32325297437-565 100% Seq ID NO: 53584 601-702 gi|33139953 453-554 99% Seq ID NO:53584 1-146 gi|22544160 265-410 86% Seq ID NO: 53584 767-855 gi|7143567305-217 88% Seq ID NO: 53584 1-128 gi|30167496 229-356 82% Seq ID NO:53771 15-54 gi|28916076 1-40 97% Seq ID NO: 53771 8-35 gi|159473 34-61100% Seq ID NO: 53771 8-35 gi|2454547 227-200 100% Seq ID NO: 53771 8-35gi|551594 194-221 100% Seq ID NO: 53771 8-35 gi|551595 669-696 100% SeqID NO: 53771 8-35 gi|18032254 34-61 100% Seq ID NO: 53771 12-35gi|18477256 279-256 100% Seq ID NO: 53771 12-35 gi|18477260 325-302 100%Seq ID NO: 53771 12-35 gi|18477262 634-611 100% Seq ID NO: 53771 8-29gi|18477259 22-1 100% Seq ID NO: 53771 8-29 gi|18477261 22-1 100% Seq IDNO: 53771 8-29 gi|37780968 22-1 100% Seq ID NO: 53788 196-219gi|19265811 51-74 100% Seq ID NO: 53903 49-125 gi|51334373 320-396 90%Seq ID NO: 53903 281-309 gi|7143754 460-488 100% Seq ID NO: 53903281-309 gi|54546250 453-481 100% Seq ID NO: 53942 1-127 gi|33140348254-128 87% Seq ID NO: 53996 95-170 gi|18080046 260-334 90% Seq ID NO:54067 614-700 gi|18080124 72-159 84% Seq ID NO: 54067 630-658gi|18082691 129-157 96% Seq ID NO: 54223 1-32 gi|18082819 494-463 100%Seq ID NO: 54223 1-32 gi|32324738 389-420 100% Seq ID NO: 54223 1-32gi|54546964 145-176 100% Seq ID NO: 54228 9-154 gi|33139017 5-150 99%Seq ID NO: 54361 453-568 gi|33140744 29-141 84% Seq ID NO: 54394 96-242gi|33140673 195-49 90% Seq ID NO: 54394 110-281 gi|33139639 256-428 88%Seq ID NO: 54404 1-41 gi|18381736 808-848 90% Seq ID NO: 54404 19-63gi|18382098 510-554 89% Seq ID NO: 54404 19-41 gi|18381713 520-542 100%Seq ID NO: 54411 69-177 gi|18090579 65-176 86% Seq ID NO: 54431 127-151gi|18089765 337-361 100% Seq ID NO: 45816 323-510 gi|52128303 201-38881% Seq ID NO: 54533 295-336 gi|7144329 64-105 97% Seq ID NO: 5459817-91 gi|33139131 116-190 92% Seq ID NO: 54598 6-97 gi|32324211 1-92 87%Seq ID NO: 54650 161-185 gi|32183975 527-551 100% Seq ID NO: 54664 1-99gi|33139292 225-323 100% Seq ID NO: 54709 1-91 gi|6562543 742-832 98%Seq ID NO: 54709 22-59 gi|18382129 349-386 92% Seq ID NO: 54710 1-112gi|18090579 65-176 90% Seq ID NO: 54782 21-138 gi|28916076 1-119 91% SeqID NO: 54782 14-41 gi|159473 34-61 100% Seq ID NO: 54782 14-41gi|2454547 227-200 100% Seq ID NO: 54782 14-41 gi|551594 194-221 100%Seq ID NO: 54782 14-41 gi|551595 669-696 100% Seq ID NO: 54782 14-41gi|18032254 34-61 100% Seq ID NO: 54782 18-41 gi|18477256 279-256 100%Seq ID NO: 54782 18-41 gi|18477260 325-302 100% Seq ID NO: 54782 18-41gi|18477262 634-611 100% Seq ID NO: 54782 14-35 gi|18477259 22-1 100%Seq ID NO: 54782 14-35 gi|18477261 22-1 100% Seq ID NO: 54782 14-35gi|37780968 22-1 100% Seq ID NO: 54811 73-94 gi|7641297 281-302 100% SeqID NO: 54839 194-282 gi|33139570 594-506 100% Seq ID NO: 54839 198-282gi|30029553 597-513 100% Seq ID NO: 54839 131-282 gi|18079986 658-50788% Seq ID NO: 54839 218-282 gi|33139565 577-513 100% Seq ID NO: 54839132-220 gi|21493943 587-499 89% Seq ID NO: 54839 232-282 gi|32324861563-513 100% Seq ID NO: 54839 240-282 gi|32324744 548-506 97% Seq ID NO:54847 3-101 gi|28916076 176-79 89% Seq ID NO: 54865 54-202 gi|33140673199-51 88% Seq ID NO: 54865 61-202 gi|33139639 245-386 88% Seq ID NO:54865 59-199 gi|33139131 99-239 86% Seq ID NO: 45847 183-308 gi|18080348231-356 85% Seq ID NO: 45847 190-308 gi|7144294 198-316 85% Seq ID NO:54946 128-285 gi|54546846 415-257 85% Seq ID NO: 54981 83-126 gi|5515941028-1070 93% Seq ID NO: 54981 83-126 gi|551595 267-309 93% Seq ID NO:54999 328-498 gi|18382277 801-971 82% Seq ID NO: 55025 372-395gi|33138439 263-286 100% Seq ID NO: 55025 419-439 gi|33952838 448-428100% Seq ID NO: 55106 21-138 gi|28916076 1-119 91% Seq ID NO: 5510614-41 gi|159473 34-61 100% Seq ID NO: 55106 14-41 gi|2454547 227-200100% Seq ID NO: 55106 14-41 gi|551594 194-221 100% Seq ID NO: 5510614-41 gi|551595 669-696 100% Seq ID NO: 55106 14-41 gi|18032254 34-81100% Seq ID NO: 55106 18-41 gi|18477256 279-256 100% Seq ID NO: 5510618-41 gi|18477260 325-302 100% Seq ID NO: 55106 18-41 gi|18477262634-611 100% Seq ID NO: 55106 14-35 gi|18477259 22-1 100% Seq ID NO:55106 14-35 gi|18477261 22-1 100% Seq ID NO: 55106 14-35 gi|3778096822-1 100% Seq ID NO: 55108 1-93 gi|7143864 42-134 82% Seq ID NO: 5511735-57 gi|7144284 325-303 100% Seq ID NO: 55119 36-155 gi|28916076 140-2090% Seq ID NO: 55175 1-160 gi|5107411 189-30 100% Seq ID NO: 55175 1-160gi|2149587 189-30 95% Seq ID NO: 55175 47-160 gi|34105813 1719-1606 100%Seq ID NO: 55175 1-160 gi|2149585 189-30 92% Seq ID NO: 55175 1-160gi|2738785 190-30 92% Seq ID NO: 55175 1-160 gi|2738792 190-30 92% SeqID NO: 55175 1-160 gi|2738799 190-30 92% Seq ID NO: 55175 1-160gi|2738800 190-30 92% Seq ID NO: 55175 2-160 gi|2707748 1766-1607 90%Seq ID NO: 55175 1-80 gi|48479719 191-112 98% Seq ID NO: 55175 1-80gi|37674501 194-115 98% Seq ID NO: 55175 115-160 gi|37674501 75-30 95%Seq ID NO: 55175 1-78 gi|1147729 2525-2448 98% Seq ID NO: 55175 115-160gi|1147729 2413-2368 97% Seq ID NO: 55175 1-78 gi|2232021 87-10 98% SeqID NO: 55226 1-24 gi|33139696 554-577 100% Seq ID NO: 55245 1-25gi|32324959 40-64 96% Seq ID NO: 55392 173-276 gi|35504550 57-160 99%Seq ID NO: 55392 326-425 gi|35504550 164-263 97% Seq ID NO: 55392 1-66gi|35504862 10-74 89% Seq ID NO: 55425 57-187 gi|54545475 93-224 92% SeqID NO: 55425 231-321 gi|54545475 220-310 90% Seq ID NO: 55468 1-141gi|32325191 449-589 94% Seq ID NO: 55545 75-99 gi|32325459 266-290 100%Seq ID NO: 55553 1-116 gi|33139587 121-6 94% Seq ID NO: 55566 167-228gi|18083080 182-243 88% Seq ID NO: 55640 887-1044 gi|33139586 1-158 96%Seq ID NO: 55660 21-138 gi|28916076 1-119 91% Seq ID NO: 55660 14-41gi|159473 34-61 100% Seq ID NO: 55660 14-41 gi|2454547 227-200 100% SeqID NO: 55660 14-41 gi|551594 194-221 100% Seq ID NO: 55660 14-41gi|551595 669-696 100% Seq ID NO: 55660 14-41 gi|18032254 34-61 100% SeqID NO: 55660 18-41 gi|18477256 279-256 100% Seq ID NO: 55660 18-41gi|18477260 325-302 100% Seq ID NO: 55660 18-41 gi|18477262 634-611 100%Seq ID NO: 55660 14-35 gi|18477259 22-1 100% Seq ID NO: 55660 14-35gi|18477261 22-1 100% Seq ID NO: 55660 14-35 gi|37780968 22-1 100% SeqID NO: 55719 1-146 gi|18080802 354-209 97% Seq ID NO: 55719 1-131gi|7144157 137-7 97% Seq ID NO: 55719 1-130 gi|18088100 130-1 97% Seq IDNO: 55719 1-124 gi|54549688 124-1 96% Seq ID NO: 55719 1-118 gi|18089409118-1 97% Seq ID NO: 55719 1-117 gi|18090782 117-1 97% Seq ID NO: 5571925-146 gi|52128973 94-215 88% Seq ID NO: 55719 1-78 gi|18083086 537-61496% Seq ID NO: 55719 1-75 gi|18090403 75-1 96% Seq ID NO: 55719 25-121gi|52129301 319-415 86% Seq ID NO: 55719 100-146 gi|52129005 1-47 95%Seq ID NO: 55719 83-146 gi|9033884 441-503 90% Seq ID NO: 55719 83-146gi|15768080 331-393 90% Seq ID NO: 55719 71-146 gi|19383550 397-471 88%Seq ID NO: 55719 83-146 gi|39747546 344-406 90% Seq ID NO: 55719 100-146gi|39747568 266-312 95% Seq ID NO: 55719 100-146 gi|46984679 145-99 95%Seq ID NO: 55719 105-146 gi|15769768 391-432 97% Seq ID NO: 55719105-146 gi|40669510 255-296 97% Seq ID NO: 55719 100-140 gi|46986671175-135 97% Seq ID NO: 55719 108-146 gi|31326272 1-39 97% Seq ID NO:55719 105-146 gi|159682 379-420 92% Seq ID NO: 55759 302-322 gi|4698448489-109 100% Seq ID NO: 55808 1-74 gi|54545841 238-311 91% Seq ID NO:55827 140-161 gi|46983788 125-146 100% Seq ID NO: 55849 95-269gi|54548527 231-405 87% Seq ID NO: 55849 138-269 gi|54545989 2-133 88%Seq ID NO: 55931 140-268 gi|33139063 130-258 96% Seq ID NO: 55931 1-60gi|33139063 71-130 98% Seq ID NO: 56068 123-187 gi|21432594 122-186 86%Seq ID NO: 56211 1-101 gi|7144157 39-139 97% Seq ID NO: 56211 1-101gi|54549688 26-126 96% Seq ID NO: 56211 22-101 gi|18083086 614-535 96%Seq ID NO: 56211 25-101 gi|18090403 1-77 96% Seq ID NO: 56211 54-75gi|52129301 340-319 100% Seq ID NO: 56333 36-56 gi|33140432 264-284 100%Seq ID NO: 56402 639-718 gi|33139730 156-76 93% Seq ID NO: 56411 1-73gi|16797830 749-821 100% Seq ID NO: 56411 1-73 gi|16797832 747-819 100%Seq ID NO: 56411 1-73 gi|26000759 745-817 100% Seq ID NO: 56411 1-73gi|31442320 746-818 100% Seq ID NO: 56411 1-73 gi|31442322 746-818 98%Seq ID NO: 56411 8-73 gi|21885259 776-841 98% Seq ID NO: 45947 974-995gi|46988291 426-405 100% Seq ID NO: 56551 187-242 gi|18381363 1139-119491% Seq ID NO: 56640 1-65 gi|33140765 16-80 98% Seq ID NO: 56699 36-174gi|28916076 140-1 92% Seq ID NO: 56699 154-181 gi|159473 61-34 100% SeqID NO: 56699 154-181 gi|2454547 200-227 100% Seq ID NO: 56699 154-181gi|551594 221-194 100% Seq ID NO: 56699 154-181 gi|551595 696-669 100%Seq ID NO: 56699 154-181 gi|18032254 61-34 100% Seq ID NO: 56699 154-177gi|18477256 256-279 100% Seq ID NO: 56699 154-177 gi|18477260 302-325100% Seq ID NO: 56699 154-177 gi|18477262 611-634 100% Seq ID NO: 56699160-181 gi|18477259 1-22 100% Seq ID NO: 56699 160-181 gi|18477261 1-22100% Seq ID NO: 56699 160-181 gi|37780968 1-22 100% Seq ID NO: 56752348-369 gi|7235459 41-20 100% Seq ID NO: 45973 404-544 gi|54546305483-623 88% Seq ID NO: 45973 181-293 gi|54546305 345-457 83% Seq ID NO:45973 434-544 gi|54544886 269-379 90% Seq ID NO: 56872 764-956gi|33140201 149-341 99% Seq ID NO: 56872 569-719 gi|33140201 1-151 98%Seq ID NO: 56889 189-333 gi|54544976 245-389 89% Seq ID NO: 56927 75-226gi|33139131 99-250 87% Seq ID NO: 56927 81-221 gi|32324211 1-141 87% SeqID NO: 56972 1-101 gi|35504916 448-548 96% Seq ID NO: 56972 1-85gi|35504725 448-533 94% Seq ID NO: 57065 168-188 gi|33952655 9-29 100%Seq ID NO: 57065 168-188 gi|46422144 284-304 100% Seq ID NO: 5708462-214 gi|33140673 201-49 92% Seq ID NO: 57084 71-214 gi|33139639245-388 93% Seq ID NO: 57086 190-217 gi|54547991 157-184 96% Seq ID NO:57220 1-66 gi|37517244 1816-1881 96% Seq ID NO: 57351 5-196 gi|32323971353-544 98% Seq ID NO: 57416 412-434 gi|551595 853-875 100% Seq ID NO:57422 783-808 gi|46986223 315-290 100% Seq ID NO: 57468 1-221 gi|223910836-256 100% Seq ID NO: 57539 18-223 gi|32325381 184-389 92% Seq ID NO:57539 415-498 gi|32325381 398-481 97% Seq ID NO: 57539 174-223gi|33139814 8-57 100% Seq ID NO: 57713 253-552 gi|33139068 28-327 97%Seq ID NO: 57713 633-701 gi|33139068 334-402 98% Seq ID NO: 57713169-198 gi|33139068 1-30 100% Seq ID NO: 57713 750-877 gi|33139557441-568 93% Seq ID NO: 57713 176-198 gi|33139557 1-23 100% Seq ID NO:57713 750-892 gi|33140099 424-566 92% Seq ID NO: 57713 255-552gi|33139079 16-313 97% Seq ID NO: 57713 750-849 gi|33139079 434-533 94%Seq ID NO: 57713 1-198 gi|32324666 26-223 96% Seq ID NO: 57713 633-661gi|32324666 527-555 100% Seq ID NO: 57713 271-552 gi|33139103 1-282 97%Seq ID NO: 57713 287-554 gi|33139308 38-306 97% Seq ID NO: 57713 637-864gi|33139308 315-545 90% Seq ID NO: 57713 1232-1423 gi|33139521 162-35396% Seq ID NO: 57713 811-966 gi|33139521 1-159 89% Seq ID NO: 57713750-878 gi|33140185 112-240 93% Seq ID NO: 57713 636-701 gi|331401851-66 98% Seq ID NO: 57713 300-551 gi|32325329 51-302 90% Seq ID NO:57713 1229-1384 gi|32325562 408-563 99% Seq ID NO: 57713 750-966gi|32325562 189-408 89% Seq ID NO: 57713 598-701 gi|32325562 40-143 96%Seq ID NO: 57713 300-534 gi|32324015 367-601 90% Seq ID NO: 577131229-1374 gi|33140373 408-553 100% Seq ID NO: 57713 655-698 gi|32325327481-524 90% Seq ID NO: 57713 655-776 gi|32325443 481-599 83% Seq ID NO:57713 300-533 gi|33140304 225-458 89% Seq ID NO: 57713 300-517gi|33140764 367-584 89% Seq ID NO: 57713 1229-1347 gi|32324619 408-526100% Seq ID NO: 57713 300-490 gi|32325078 367-557 90% Seq ID NO: 577131254-1423 gi|33139825 258-427 90% Seq ID NO: 57713 300-453 gi|33139892410-563 90% Seq ID NO: 57713 300-452 gi|33140637 410-562 90% Seq ID NO:57713 1288-1423 gi|32324536 247-382 91% Seq ID NO: 57713 414-551gi|32324536 10-147 89% Seq ID NO: 57713 384-551 gi|33140589 1-168 88%Seq ID NO: 57713 393-551 gi|33140074 4-162 88% Seq ID NO: 57713 655-896gi|32325371 173-411 83% Seq ID NO: 57713 300-401 gi|32324162 476-577 92%Seq ID NO: 57713 428-551 gi|32324623 225-348 88% Seq ID NO: 57713300-375 gi|32324623 103-178 93% Seq ID NO: 57713 300-353 gi|33139035491-544 94% Seq ID NO: 57793 1-189 gi|33139897 347-535 91% Seq ID NO:57824 6-146 gi|32324211 1-141 88% Seq ID NO: 57824 17-151 gi|33139131116-250 88% Seq ID NO: 57824 1-145 gi|33140673 193-49 83% Seq ID NO:57824 13-145 gi|33139639 256-388 82% Seq ID NO: 57824 152-172gi|45563795 521-501 100% Seq ID NO: 57907 1213-1367 gi|18081696 127-28185% Seq ID NO: 57996 664-684 gi|39747573 93-113 100% Seq ID NO: 580014-65 gi|27541441 265-326 84% Seq ID NO: 46037 415-488 gi|21393234329-402 91% Seq ID NO: 46037 117-143 gi|21393234 148-174 100% Seq ID NO:46037 377-488 gi|18080246 261-372 85% Seq ID NO: 46037 418-488gi|18381808 299-229 90% Seq ID NO: 46037 427-488 gi|52128281 271-332 91%Seq ID NO: 58171 85-106 gi|22139607 133-112 100% Seq ID NO: 58223117-138 gi|16746280 49-70 100% Seq ID NO: 58237 123-269 gi|33139639245-388 89% Seq ID NO: 58237 114-269 gi|33140673 201-49 87% Seq ID NO:58237 121-246 gi|33139131 99-224 82% Seq ID NO: 58237 333-378gi|32324211 211-256 91% Seq ID NO: 58264 869-1010 gi|18082514 100-24190% Seq ID NO: 58264 1203-1298 gi|18082514 245-340 92% Seq ID NO: 58357136-238 gi|54549492 497-599 88% Seq ID NO: 58357 369-416 gi|54549492633-680 91% Seq ID NO: 58382 12-159 gi|18383012 1144-1291 84% Seq ID NO:58427 1-143 gi|16797830 62-204 97% Seq ID NO: 58427 1-143 gi|1679783162-202 97% Seq ID NO: 58427 1-143 gi|16797832 63-203 97% Seq ID NO:58427 1-143 gi|47118285 74-214 96% Seq ID NO: 58427 1-142 gi|2600076262-201 95% Seq ID NO: 58427 1-143 gi|31442322 62-202 95% Seq ID NO:58427 1-143 gi|38096133 62-202 93% Seq ID NO: 58438 319-443 gi|32324393180-307 96% Seq ID NO: 58438 1-83 gi|32324393 25-108 98% Seq ID NO:58438 154-195 gi|32324393 107-148 100% Seq ID NO: 58438 239-273gi|32324393 147-181 100% Seq ID NO: 58438 319-383 gi|33140257 413-47798% Seq ID NO: 58438 154-191 gi|33140257 345-382 100% Seq ID NO: 58478192-238 gi|18089268 22-68 89% Seq ID NO: 58670 81-148 gi|18090251452-519 88% Seq ID NO: 58722 346-432 gi|19266104 21-107 84% Seq ID NO:46083 1-119 gi|33139765 458-576 99% Seq ID NO: 46083 265-305 gi|52128426396-436 95% Seq ID NO: 46084 295-415 gi|33139765 576-456 99% Seq ID NO:46084 479-528 gi|33139765 454-405 100% Seq ID NO: 46084 109-149gi|52128426 436-396 95% Seq ID NO: 58765 150-170 gi|18081306 417-437100% Seq ID NO: 58768 2-66 gi|7143917 219-283 90% Seq ID NO: 58912483-676 gi|7144233 308-501 87% Seq ID NO: 58912 483-667 gi|7144339311-495 86% Seq ID NO: 58913 201-271 gi|52128794 102-172 90% Seq ID NO:58913 187-272 gi|46985091 319-234 85% Seq ID NO: 58913 85-121gi|18080422 365-401 94% Seq ID NO: 58913 249-272 gi|46985022 259-236100% Seq ID NO: 59048 23-135 gi|33139660 491-603 98% Seq ID NO: 59083221-245 gi|46987193 49-25 96% Seq ID NO: 59131 1-107 gi|18082168 3-10994% Seq ID NO: 59131 16-108 gi|51334509 37-129 88% Seq ID NO: 5913170-108 gi|52129549 138-176 89% Seq ID NO: 59132 1-56 gi|7641359 475-42091% Seq ID NO: 59132 1-59 gi|14085900 55-113 90% Seq ID NO: 59132 1-47gi|14086110 51-97 93% Seq ID NO: 59132 1-56 gi|15767660 65-120 89% SeqID NO: 59132 1-56 gi|18495580 501-446 89% Seq ID NO: 59132 19-59gi|39747527 61-101 92% Seq ID NO: 59201 21-138 gi|28916076 1-119 91% SeqID NO: 59201 14-41 gi|159473 34-61 100% Seq ID NO: 59201 14-41gi|2454547 227-200 100% Seq ID NO: 59201 14-41 gi|551594 194-221 100%Seq ID NO: 59201 14-41 gi|551595 669-696 100% Seq ID NO: 59201 14-41gi|18032254 34-61 100% Seq ID NO: 59201 18-41 gi|18477256 279-256 100%Seq ID NO: 59201 18-41 gi|18477260 325-302 100% Seq ID NO: 59201 18-41gi|18477262 634-611 100% Seq ID NO: 59201 14-35 gi|18477259 22-1 100%Seq ID NO: 59201 14-35 gi|18477261 22-1 100% Seq ID NO: 59201 14-35gi|37780968 22-1 100% Seq ID NO: 59268 131-256 gi|33139189 152-277 99%Seq ID NO: 59268 1-56 gi|33139189 96-151 98% Seq ID NO: 59269 3-85gi|7143543 482-564 88% Seq ID NO: 59311 1-81 gi|7144052 441-521 90% SeqID NO: 59484 1-118 gi|18080802 329-209 95% Seq ID NO: 59484 1-103gi|7144157 112-7 95% Seq ID NO: 59484 1-102 gi|18088100 105-1 95% Seq IDNO: 59484 1-96 gi|54549688 99-1 93% Seq ID NO: 59484 1-90 gi|1808940993-1 94% Seq ID NO: 59484 1-89 gi|18090782 92-1 94% Seq ID NO: 5948456-118 gi|9033884 441-503 90% Seq ID NO: 59484 56-118 gi|15768080331-393 90% Seq ID NO: 59484 56-118 gi|39747546 344-406 90% Seq ID NO:59484 77-118 gi|15769768 391-432 97% Seq ID NO: 59484 77-118 gi|19383550430-471 97% Seq ID NO: 59484 77-118 gi|39747568 271-312 97% Seq ID NO:59484 77-112 gi|46986671 170-135 100% Seq ID NO: 59484 80-118gi|31326272 1-39 97% Seq ID NO: 59484 77-118 gi|159682 379-420 92% SeqID NO: 59484 77-118 gi|15784370 177-218 92% Seq ID NO: 59530 1-129gi|33139697 192-63 97% Seq ID NO: 59558 15-54 gi|28916076 1-40 97% SeqID NO: 59558 8-35 gi|159473 34-61 100% Seq ID NO: 59558 8-35 gi|2454547227-200 100% Seq ID NO: 59558 8-35 gi|551594 194-221 100% Seq ID NO:59558 8-35 gi|551595 669-696 100% Seq ID NO: 59558 8-35 gi|1803225434-61 100% Seq ID NO: 59558 12-35 gi|18477256 279-256 100% Seq ID NO:59558 12-35 gi|18477260 325-302 100% Seq ID NO: 59558 12-35 gi|18477262634-611 100% Seq ID NO: 59558 8-29 gi|18477259 22-1 100% Seq ID NO:59558 8-29 gi|18477261 22-1 100% Seq ID NO: 59558 8-29 gi|37780968 22-1100% Seq ID NO: 59597 141-230 gi|33139639 297-386 85% Seq ID NO: 59597141-193 gi|33140673 140-88 90% Seq ID NO: 59652 7-89 gi|33140014 182-264100% Seq ID NO: 59901 94-114 gi|7798143 245-225 100% Seq ID NO: 599111-141 gi|2239106 1166-1307 98% Seq ID NO: 59911 1-141 gi|375172431166-1307 98% Seq ID NO: 59944 178-340 gi|33139639 257-419 89% Seq IDNO: 59944 159-310 gi|33140673 199-48 87% Seq ID NO: 60099 81-221gi|32324211 1-141 90% Seq ID NO: 60099 92-226 gi|33139131 116-250 89%Seq ID NO: 60099 305-349 gi|33139131 293-337 93% Seq ID NO: 60099 70-220gi|33140673 199-49 86% Seq ID NO: 60099 89-264 gi|33139639 257-429 84%Seq ID NO: 60155 1-67 gi|7144157 200-134 91% Seq ID NO: 60155 1-67gi|54549688 187-121 91% Seq ID NO: 60206 46-72 gi|159473 61-35 100% SeqID NO: 60206 46-72 gi|2454547 200-226 100% Seq ID NO: 60206 46-72gi|551594 221-195 100% Seq ID NO: 60206 46-72 gi|551595 696-670 100% SeqID NO: 60206 46-72 gi|18032254 61-35 100% Seq ID NO: 60206 46-69gi|18477256 256-279 100% Seq ID NO: 60206 46-69 gi|18477260 302-325 100%Seq ID NO: 60206 46-69 gi|18477262 611-634 100% Seq ID NO: 60206 52-72gi|18477259 1-21 100% Seq ID NO: 60206 52-72 gi|18477261 1-21 100% SeqID NO: 60206 52-72 gi|37780968 1-21 100% Seq ID NO: 60211 305-361gi|18081798 206-262 91% Seq ID NO: 60211 305-361 gi|54547765 204-260 89%Seq ID NO: 60211 52-77 gi|9829211 55-80 100% Seq ID NO: 60211 52-77gi|21652727 26-51 100% Seq ID NO: 60220 1-95 gi|33140348 161-255 96% SeqID NO: 60378 15-74 gi|28916076 1-60 93% Seq ID NO: 60378 8-35 gi|15947334-61 100% Seq ID NO: 60378 8-35 gi|2454547 227-200 100% Seq ID NO:60378 8-35 gi|551594 194-221 100% Seq ID NO: 60378 8-35 gi|551595669-696 100% Seq ID NO: 60378 8-35 gi|18032254 34-61 100% Seq ID NO:60378 12-35 gi|18477256 279-256 100% Seq ID NO: 60378 12-35 gi|18477260325-302 100% Seq ID NO: 60378 12-35 gi|18477262 634-611 100% Seq ID NO:60378 8-29 gi|18477259 22-1 100% Seq ID NO: 60378 8-29 gi|18477261 22-1100% Seq ID NO: 60378 8-29 gi|37780968 22-1 100% Seq ID NO: 60409273-620 gi|32324347 2-350 96% Seq ID NO: 60409 680-806 gi|32324347351-477 98% Seq ID NO: 60409 680-784 gi|33139657 351-455 98% Seq ID NO:60409 289-615 gi|33140241 13-338 97% Seq ID NO: 60409 677-806gi|33140241 341-470 98% Seq ID NO: 60409 272-619 gi|33139259 1-348 95%Seq ID NO: 60409 278-620 gi|33139000 12-355 96% Seq ID NO: 60409 272-615gi|33139478 1-339 96% Seq ID NO: 60409 268-620 gi|32324919 1-354 95% SeqID NO: 60409 273-609 gi|33139390 2-339 95% Seq ID NO: 60409 391-620gi|33140350 66-296 96% Seq ID NO: 60409 273-337 gi|33140350 2-66 93% SeqID NO: 60409 498-615 gi|33139683 1-118 98% Seq ID NO: 60495 1-88gi|5107411 569-656 100% Seq ID NO: 60495 1-88 gi|16797830 379-466 100%Seq ID NO: 60495 1-88 gi|16797831 377-464 98% Seq ID NO: 60495 1-88gi|16797832 378-465 98% Seq ID NO: 60495 1-88 gi|31442322 377-464 98%Seq ID NO: 60495 5-88 gi|26000760 381-464 98% Seq ID NO: 60495 1-88gi|38096135 377-465 95% Seq ID NO: 60495 12-86 gi|31442314 392-467 93%Seq ID NO: 60495 4-86 gi|38096138 386-468 90% Seq ID NO: 60495 3-86gi|16797838 399-482 89% Seq ID NO: 60532 243-263 gi|46986928 44-64 100%Seq ID NO: 60533 237-257 gi|46986928 64-44 100% Seq ID NO: 60534 122-248gi|33139568 363-489 99% Seq ID NO: 60534 1-75 gi|33139568 293-367 98%Seq ID NO: 60596 1-99 gi|33139131 122-220 89% Seq ID NO: 60596 29-92gi|33140673 143-80 87% Seq ID NO: 60782 367-405 gi|54547981 61-24 94%Seq ID NO: 60782 792-812 gi|19435901 24-4 100% Seq ID NO: 60783 218-256gi|54547981 24-61 94% Seq ID NO: 60788 36-165 gi|28916076 140-10 92% SeqID NO: 60644 26-217 gi|18080216 20-211 86% Seq ID NO: 60947 1-85gi|33140348 171-255 97% Seq ID NO: 61051 73-95 gi|39746742 647-625 100%Seq ID NO: 61179 139-160 gi|34316675 229-208 100% Seq ID NO: 6121776-324 gi|33139639 245-493 87% Seq ID NO: 61217 71-219 gi|33140673197-49 89% Seq ID NO: 61217 115-219 gi|32324211 36-140 88% Seq ID NO:61303 36-174 gi|28916076 140-1 92% Seq ID NO: 61303 154-181 gi|15947361-34 100% Seq ID NO: 61303 154-181 gi|2454547 200-227 100% Seq ID NO:61303 154-181 gi|551594 221-194 100% Seq ID NO: 61303 154-181 gi|551595696-669 100% Seq ID NO: 61303 154-181 gi|18032254 61-34 100% Seq ID NO:61303 154-177 gi|18477256 256-279 100% Seq ID NO: 61303 154-177gi|18477260 302-325 100% Seq ID NO: 61303 154-177 gi|18477262 611-634100% Seq ID NO: 61303 160-181 gi|18477259 1-22 100% Seq ID NO: 61303160-181 gi|18477261 1-22 100% Seq ID NO: 61303 160-181 gi|37780968 1-22100% Seq ID NO: 61351 4-140 gi|18080057 229-93 85% Seq ID NO: 613514-140 gi|54547707 233-97 84% Seq ID NO: 61352 186-312 gi|18080057303-429 90% Seq ID NO: 61352 1-134 gi|18080057 159-292 88% Seq ID NO:61352 1-139 gi|54547707 163-300 88% Seq ID NO: 61352 186-312 gi|54547707307-433 89% Seq ID NO: 61352 363-405 gi|54547707 443-485 90% Seq ID NO:61526 29-138 gi|28916076 9-119 91% Seq ID NO: 61553 31-90 gi|51237572280-339 86% Seq ID NO: 46246 1-165 gi|33139608 404-567 98% Seq ID NO:46246 1-62 gi|32323997 404-464 98% Seq ID NO: 61561 480-500 gi|5454675460-40 100% Seq ID NO: 61600 3-49 gi|32324211 94-140 93% Seq ID NO: 61643680-700 gi|23260226 280-260 100% Seq ID NO: 61686 1-73 gi|16797830749-821 100% Seq ID NO: 61686 1-73 gi|16797832 747-819 100% Seq ID NO:61686 1-73 gi|26000759 745-817 100% Seq ID NO: 61686 1-73 gi|31442320746-818 100% Seq ID NO: 61686 1-73 gi|31442322 746-818 98% Seq ID NO:61686 8-73 gi|21885259 776-841 98% Seq ID NO: 61688 97-117 gi|5212872561-81 100% Seq ID NO: 61719 701-721 gi|32185118 164-144 100% Seq ID NO:61810 13-290 gi|33140136 39-317 85% Seq ID NO: 61937 5-25 gi|19267937180-160 100% Seq ID NO: 61955 1-89 gi|33139270 471-383 100% Seq ID NO:61980 1-78 gi|33140673 126-49 92% Seq ID NO: 61980 19-84 gi|33139131185-250 92% Seq ID NO: 61980 250-309 gi|33139639 521-580 91% Seq ID NO:46268 777-960 gi|33140770 55-237 99% Seq ID NO: 46268 639-695gi|33140770 1-57 98% Seq ID NO: 62170 207-227 gi|46422740 369-349 100%Seq ID NO: 62176 200-225 gi|30028086 358-383 100% Seq ID NO: 62176198-218 gi|16005270 477-497 100% Seq ID NO: 46279 5-190 gi|35504456365-180 96% Seq ID NO: 46279 63-188 gi|7143616 133-258 91% Seq ID NO:46279 63-188 gi|18381740 382-507 91% Seq ID NO: 46279 63-184 gi|18381740131-252 91% Seq ID NO: 46279 76-188 gi|18381923 603-491 92% Seq ID NO:46279 73-188 gi|21493480 425-310 85% Seq ID NO: 62414 1-128 gi|32323987182-309 100% Seq ID NO: 62414 173-246 gi|32323987 309-382 100% Seq IDNO: 62414 173-222 gi|33140527 311-360 98% Seq ID NO: 62539 200-225gi|54547695 79-104 96% Seq ID NO: 62576 170-298 gi|33139556 85-213 94%Seq ID NO: 62576 42-126 gi|33139556 1-85 98% Seq ID NO: 46293 154-303gi|18080373 337-486 91% Seq ID NO: 62611 1-45 gi|33139645 197-241 97%Seq ID NO: 62636 254-285 gi|18382597 630-661 96% Seq ID NO: 46302262-359 gi|18381117 806-709 92% Seq ID NO: 46302 262-362 gi|7144103148-248 91% Seq ID NO: 46305 314-411 gi|54548221 329-232 88% Seq ID NO:62776 3-93 gi|33140348 248-158 89% Seq ID NO: 62899 1-140 gi|32325217127-267 96% Seq ID NO: 62904 154-306 gi|18082113 218-370 89% Seq ID NO:62904 158-306 gi|54549200 528-380 87% Seq ID NO: 62904 159-306gi|54545591 457-310 87% Seq ID NO: 62904 179-306 gi|7144024 503-377 87%Seq ID NO: 62986 504-558 gi|32324842 158-212 92% Seq ID NO: 63015 1-98gi|32324253 114-17 94% Seq ID NO: 63100 554-708 gi|32324459 81-235 95%Seq ID NO: 63107 36-174 gi|28916076 140-1 92% Seq ID NO: 63107 154-181gi|159473 61-34 100% Seq ID NO: 63107 154-181 gi|2454547 200-227 100%Seq ID NO: 63107 154-181 gi|551594 221-194 100% Seq ID NO: 63107 154-181gi|551595 696-669 100% Seq ID NO: 63107 154-181 gi|18032254 61-34 100%Seq ID NO: 63107 154-177 gi|18477256 256-279 100% Seq ID NO: 63107154-177 gi|18477260 302-325 100% Seq ID NO: 63107 154-177 gi|18477262611-634 100% Seq ID NO: 63107 160-181 gi|18477259 1-22 100% Seq ID NO:63107 160-181 gi|18477261 1-22 100% Seq ID NO: 63107 160-181 gi|377809681-22 100% Seq ID NO: 63189 13-47 gi|7143962 302-269 94% Seq ID NO: 63384576-772 gi|32325427 297-493 97% Seq ID NO: 63384 217-381 gi|3232542750-214 98% Seq ID NO: 63384 428-514 gi|32325427 213-299 100% Seq ID NO:63384 110-159 gi|32325427 1-50 94% Seq ID NO: 63384 629-772 gi|180899171-144 88% Seq ID NO: 63450 15-54 gi|28916076 1-40 97% Seq ID NO: 634508-35 gi|159473 34-61 100% Seq ID NO: 63450 8-35 gi|2454547 227-200 100%Seq ID NO: 63450 8-35 gi|551594 194-221 100% Seq ID NO: 63450 8-35gi|551595 669-696 100% Seq ID NO: 63450 8-35 gi|18032254 34-61 100% SeqID NO: 63450 12-35 gi|18477256 279-256 100% Seq ID NO: 63450 12-35gi|18477260 325-302 100% Seq ID NO: 63450 12-35 gi|18477262 634-611 100%Seq ID NO: 63450 8-29 gi|18477259 22-1 100% Seq ID NO: 63450 8-29gi|18477261 22-1 100% Seq ID NO: 63450 8-29 gi|37780968 22-1 100% Seq IDNO: 63508 16-41 gi|30166029 39-64 96% Seq ID NO: 63529 4-111 gi|33140348148-255 93% Seq ID NO: 46344 3-176 gi|54548367 26-199 85% Seq ID NO:46344 461-533 gi|54548367 299-371 90% Seq ID NO: 63626 1-138 gi|1809077620-157 88% Seq ID NO: 46358 286-423 gi|7144085 453-591 91% Seq ID NO:63905 1-73 gi|16797830 749-821 100% Seq ID NO: 63905 1-73 gi|16797832747-819 100% Seq ID NO: 63905 1-73 gi|26000759 745-817 100% Seq ID NO:63905 1-73 gi|31442320 746-818 100% Seq ID NO: 63905 1-73 gi|31442322746-818 98% Seq ID NO: 63905 8-73 gi|21885259 776-841 98% Seq ID NO:63930 36-103 gi|18080500 393-460 86% Seq ID NO: 64030 21-138 gi|289160761-119 91% Seq ID NO: 64030 14-41 gi|159473 34-61 100% Seq ID NO: 6403014-41 gi|2454547 227-200 100% Seq ID NO: 64030 14-41 gi|551594 194-221100% Seq ID NO: 64030 14-41 gi|551595 669-696 100% Seq ID NO: 6403014-41 gi|18032254 34-61 100% Seq ID NO: 64030 18-41 gi|18477256 279-256100% Seq ID NO: 64030 18-41 gi|18477260 325-302 100% Seq ID NO: 6403018-41 gi|18477262 634-611 100% Seq ID NO: 64030 14-35 gi|18477259 22-1100% Seq ID NO: 64030 14-35 gi|18477261 22-1 100% Seq ID NO: 64030 14-35gi|37780968 22-1 100% Seq ID NO: 64057 318-354 gi|33139730 82-119 97%Seq ID NO: 64082 276-364 gi|32325034 20-108 96% Seq ID NO: 64082 74-140gi|32324889 134-68 100% Seq ID NO: 64082 325-364 gi|33139589 1-40 97%Seq ID NO: 64082 74-147 gi|21393259 27-100 85% Seq ID NO: 64082 74-147gi|20498904 25-98 85% Seq ID NO: 64106 2-35 gi|20064584 1-34 97% Seq IDNO: 64146 339-442 gi|33140237 332-229 100% Seq ID NO: 64146 21-109gi|33140237 419-331 97% Seq ID NO: 64146 452-518 gi|33140237 219-153 98%Seq ID NO: 64146 580-617 gi|33140237 151-114 100% Seq ID NO: 64148457-691 gi|32325211 200-434 98% Seq ID NO: 64148 302-406 gi|3232521196-200 96% Seq ID NO: 64148 145-221 gi|32325211 24-99 96% Seq ID NO:64148 735-788 gi|32325211 434-487 100% Seq ID NO: 64148 897-948gi|32325211 540-591 100% Seq ID NO: 64148 798-842 gi|32325211 497-541100% Seq ID NO: 64148 75-101 gi|32325211 1-27 100% Seq ID NO: 64148897-1003 gi|32324168 477-583 99% Seq ID NO: 64148 186-221 gi|323241681-36 97% Seq ID NO: 64148 457-684 gi|33139272 319-546 98% Seq ID NO:64148 1-101 gi|33139272 46-146 99% Seq ID NO: 64148 897-996 gi|33139975485-584 99% Seq ID NO: 64148 178-221 gi|33139975 1-44 97% Seq ID NO:64181 21-138 gi|28916076 1-119 91% Seq ID NO: 64181 14-41 gi|15947334-61 100% Seq ID NO: 64181 14-41 gi|2454547 227-200 100% Seq ID NO:64181 14-41 gi|551594 194-221 100% Seq ID NO: 64181 14-41 gi|551595669-696 100% Seq ID NO: 64181 14-41 gi|18032254 34-61 100% Seq ID NO:64181 18-41 gi|18477256 279-256 100% Seq ID NO: 64181 18-41 gi|18477260325-302 100% Seq ID NO: 64181 18-41 gi|18477262 634-611 100% Seq ID NO:64181 14-35 gi|18477259 22-1 100% Seq ID NO: 64181 14-35 gi|1847726122-1 100% Seq ID NO: 64181 14-35 gi|37780968 22-1 100% Seq ID NO: 64254169-189 gi|18083118 116-136 100% Seq ID NO: 64254 169-189 gi|18382418749-769 100% Seq ID NO: 64268 399-518 gi|54546846 257-382 82% Seq ID NO:64305 240-260 gi|24467850 41-21 100% Seq ID NO: 64309 148-249gi|18088868 401-300 90% Seq ID NO: 46394 286-347 gi|46987704 361-300 85%Seq ID NO: 64348 295-324 gi|7922575 190-218 96% Seq ID NO: 64377 645-702gi|54544818 520-577 91% Seq ID NO: 64390 83-126 gi|551594 1028-1070 93%Seq ID NO: 64390 83-126 gi|551595 267-309 93% Seq ID NO: 64477 21-138gi|28916076 1-119 91% Seq ID NO: 64477 14-41 gi|159473 34-61 100% Seq IDNO: 64477 14-41 gi|2454547 227-200 100% Seq ID NO: 64477 14-41 gi|551594194-221 100% Seq ID NO: 64477 14-41 gi|551595 669-696 100% Seq ID NO:64477 14-41 gi|18032254 34-61 100% Seq ID NO: 64477 18-41 gi|18477256279-256 100% Seq ID NO: 64477 18-41 gi|18477260 325-302 100% Seq ID NO:64477 18-41 gi|18477262 634-611 100% Seq ID NO: 64477 14-35 gi|1847725922-1 100% Seq ID NO: 64477 14-35 gi|18477261 22-1 100% Seq ID NO: 6447714-35 gi|37780968 22-1 100% Seq ID NO: 64508 27-157 gi|32324692 9-13997% Seq ID NO: 64514 52-204 gi|32324211 369-521 85% Seq ID NO: 6457021-138 gi|28916076 1-119 91% Seq ID NO: 64570 14-41 gi|159473 34-61 100%Seq ID NO: 64570 14-41 gi|2454547 227-200 100% Seq ID NO: 64570 14-41gi|551594 194-221 100% Seq ID NO: 64570 14-41 gi|551595 669-696 100% SeqID NO: 64570 14-41 gi|18032254 34-61 100% Seq ID NO: 64570 18-41gi|18477256 279-256 100% Seq ID NO: 64570 18-41 gi|18477260 325-302 100%Seq ID NO: 64570 18-41 gi|18477262 634-611 100% Seq ID NO: 64570 14-35gi|18477259 22-1 100% Seq ID NO: 64570 14-35 gi|18477261 22-1 100% SeqID NO: 64570 14-35 gi|37780968 22-1 100% Seq ID NO: 64580 33-147gi|32325191 1-115 92% Seq ID NO: 64633 2-114 gi|33140530 1-113 100% SeqID NO: 64669 300-320 gi|52129537 617-637 100% Seq ID NO: 64692 1-57gi|30028941 55-111 98% Seq ID NO: 64692 1-57 gi|33140204 55-111 98% SeqID NO: 64694 95-147 gi|30028941 402-454 100% Seq ID NO: 64694 1-43gi|30028941 308-350 100% Seq ID NO: 64694 95-146 gi|30029072 395-446100% Seq ID NO: 64694 95-146 gi|33140204 402-453 100% Seq ID NO: 646941-43 gi|33140204 308-350 100% Seq ID NO: 64711 1-93 gi|33139524 233-14198% Seq ID NO: 64760 19-144 gi|54546846 257-383 84% Seq ID NO: 64762174-194 gi|9829276 404-424 100% Seq ID NO: 64762 174-194 gi|39747004520-540 100% Seq ID NO: 64780 1-118 gi|33140548 49-166 95% Seq ID NO:64780 151-225 gi|33140548 199-273 100% Seq ID NO: 64797 33-77gi|54549640 11-55 93% Seq ID NO: 64847 42-190 gi|33140348 108-255 92%Seq ID NO: 64847 1-70 gi|33140348 105-174 92% Seq ID NO: 64847 1-28gi|32324498 17-44 96% Seq ID NO: 64862 7-144 gi|7144222 169-306 92% SeqID NO: 64862 7-144 gi|54547280 139-276 92% Seq ID NO: 64862 32-149gi|51334182 168-285 88% Seq ID NO: 64862 27-146 gi|21494008 204-323 86%Seq ID NO: 64874 205-293 gi|18082074 129-217 83% Seq ID NO: 64931 21-41gi|46984838 273-293 100% Seq ID NO: 64935 1-84 gi|16797830 749-832 98%Seq ID NO: 64935 1-84 gi|16797832 747-830 98% Seq ID NO: 64935 1-84gi|26000759 745-828 98% Seq ID NO: 64935 1-84 gi|31442320 746-829 98%Seq ID NO: 64935 1-84 gi|31442322 746-829 97% Seq ID NO: 64935 1-77gi|16797831 747-823 98% Seq ID NO: 64935 8-84 gi|21885259 776-852 97%Seq ID NO: 65078 15-54 gi|28916076 1-40 97% Seq ID NO: 65084 1-166gi|35504916 448-612 94% Seq ID NO: 65084 1-85 gi|35504725 448-533 94%Seq ID NO: 65205 338-610 gi|33140653 131-403 100% Seq ID NO: 65205160-291 gi|33140653 1-132 97% Seq ID NO: 65205 663-717 gi|33140653401-455 100% Seq ID NO: 65205 725-746 gi|33140653 463-484 100% Seq IDNO: 65205 338-462 gi|32325169 130-254 100% Seq ID NO: 65205 161-291gi|32325169 1-131 97% Seq ID NO: 65205 500-610 gi|32325169 254-364 100%Seq ID NO: 65205 162-291 gi|33139964 1-130 98% Seq ID NO: 65261 1-243gi|33140806 9-255 95% Seq ID NO: 65261 62-234 gi|33139704 24-199 95% SeqID NO: 46445 556-665 gi|33139443 1-111 95% Seq ID NO: 65328 1-145gi|18090481 97-241 85% Seq ID NO: 46452 1-163 gi|7143710 57-220 89% SeqID NO: 46452 1-171 gi|54549870 42-213 89% Seq ID NO: 46452 16-163gi|18089601 5-153 90% Seq ID NO: 46452 29-163 gi|54547929 1-136 90% SeqID NO: 46452 1-171 gi|51237630 23-194 85% Seq ID NO: 65377 274-524gi|33140174 121-371 98% Seq ID NO: 65377 109-229 gi|33140174 1-121 99%Seq ID NO: 65377 578-680 gi|33140174 372-474 100% Seq ID NO: 65377274-527 gi|32324003 302-555 98% Seq ID NO: 65377 1-229 gi|3232400374-302 97% Seq ID NO: 65377 579-621 gi|32324003 554-596 97% Seq ID NO:65377 42-229 gi|32324933 13-200 97% Seq ID NO: 65377 579-680 gi|32324933452-553 99% Seq ID NO: 65397 90-152 gi|18081066 414-476 90% Seq ID NO:65474 2-53 gi|28916076 52-1 94% Seq ID NO: 65474 33-59 gi|159473 61-35100% Seq ID NO: 65474 33-59 gi|2454547 200-226 100% Seq ID NO: 6547433-59 gi|551594 221-195 100% Seq ID NO: 65474 33-59 gi|551595 696-670100% Seq ID NO: 65474 33-59 gi|18032254 61-35 100% Seq ID NO: 6547433-56 gi|18477256 256-279 100% Seq ID NO: 65474 33-56 gi|18477260302-325 100% Seq ID NO: 65474 33-56 gi|18477262 611-634 100% Seq ID NO:65474 39-59 gi|18477259 1-21 100% Seq ID NO: 65474 39-59 gi|184772611-21 100% Seq ID NO: 65474 39-59 gi|37780968 1-21 100% Seq ID NO: 655531-63 gi|33140188 109-171 98% Seq ID NO: 65553 2-50 gi|33140188 305-35390% Seq ID NO: 65558 14-173 gi|18383267 801-960 86% Seq ID NO: 65560480-623 gi|35504861 212-70 97% Seq ID NO: 65572 411-582 gi|33139585153-324 94% Seq ID NO: 65572 135-258 gi|33139585 1-124 96% Seq ID NO:65591 217-309 gi|32324909 25-117 93% Seq ID NO: 46459 203-386gi|18081711 111-294 86% Seq ID NO: 46459 1-35 gi|54549088 64-98 94% SeqID NO: 46459 14-35 gi|54549319 1-22 100% Seq ID NO: 65643 1-194gi|33140010 238-431 96% Seq ID NO: 65643 121-194 gi|33139314 1-74 97%Seq ID NO: 46465 159-238 gi|9829340 394-315 87% Seq ID NO: 65738 166-205gi|33140813 1-40 95% Seq ID NO: 65755 83-126 gi|551594 1028-1070 93% SeqID NO: 65755 83-126 gi|551595 267-309 93% Seq ID NO: 65813 1-121gi|28916076 122-1 91% Seq ID NO: 65813 101-126 gi|159473 61-36 100% SeqID NO: 65813 101-126 gi|2454547 200-225 100% Seq ID NO: 65813 101-126gi|551594 221-196 100% Seq ID NO: 65813 101-126 gi|551595 696-671 100%Seq ID NO: 65813 101-126 gi|18032254 61-36 100% Seq ID NO: 65813 101-124gi|18477256 256-279 100% Seq ID NO: 65813 101-124 gi|18477260 302-325100% Seq ID NO: 65813 101-124 gi|18477262 611-634 100% Seq ID NO: 6583180-203 gi|33140348 128-252 92% Seq ID NO: 65932 77-335 gi|33139639245-503 87% Seq ID NO: 65932 70-225 gi|33140673 199-44 88% Seq ID NO:65932 245-349 gi|32324211 165-269 86% Seq ID NO: 65932 293-327gi|33139131 293-327 94% Seq ID NO: 65966 1-215 gi|33140309 306-520 97%Seq ID NO: 66029 1-143 gi|16797830 62-204 98% Seq ID NO: 66029 1-143gi|16797831 62-202 97% Seq ID NO: 66029 1-143 gi|16797832 63-203 97% SeqID NO: 66029 1-143 gi|26000759 62-202 96% Seq ID NO: 66029 1-142gi|26000762 62-201 95% Seq ID NO: 66029 1-143 gi|31442322 62-202 95% SeqID NO: 66029 1-143 gi|38096133 62-202 93% Seq ID NO: 46479 143-293gi|18090185 96-246 90% Seq ID NO: 46479 412-474 gi|18090185 320-382 92%Seq ID NO: 66115 83-126 gi|551594 1028-1070 93% Seq ID NO: 66115 83-126gi|551595 267-309 93% Seq ID NO: 66138 21-168 gi|54546322 121-268 87%Seq ID NO: 66223 311-331 gi|32184515 225-245 100% Seq ID NO: 66245118-139 gi|54544778 203-224 100% Seq ID NO: 46493 113-266 gi|33139869326-476 92% Seq ID NO: 46493 115-269 gi|18080529 142-293 90% Seq ID NO:46493 113-240 gi|32325143 467-591 91% Seq ID NO: 46493 113-229gi|33139507 467-580 92% Seq ID NO: 46493 113-236 gi|33139658 467-587 91%Seq ID NO: 46493 113-217 gi|33139078 467-568 91% Seq ID NO: 46493171-286 gi|7143888 280-392 89% Seq ID NO: 46493 113-216 gi|33140145467-567 91% Seq ID NO: 46493 186-269 gi|18080754 174-257 92% Seq ID NO:46493 144-269 gi|52129866 585-708 86% Seq ID NO: 46493 192-286gi|18080844 1-95 89% Seq ID NO: 46493 144-254 gi|52129769 700-808 87%Seq ID NO: 46493 113-190 gi|32324728 467-544 92% Seq ID NO: 46493113-204 gi|33140539 467-555 90% Seq ID NO: 46493 118-239 gi|19264211454-572 82% Seq ID NO: 46493 211-245 gi|22140574 144-178 97% Seq ID NO:46493 220-254 gi|52129262 3-37 94% Seq ID NO: 46493 113-182 gi|15784199440-509 84% Seq ID NO: 46493 113-148 gi|15785306 458-492 91% Seq ID NO:66327 1-81 gi|35504440 192-272 95% Seq ID NO: 66372 317-509 gi|54546686284-476 85% Seq ID NO: 66372 454-509 gi|33139916 1-56 100% Seq ID NO:66454 23-43 gi|21285223 168-148 100% Seq ID NO: 66513 28-80 gi|54545135100-152 96% Seq ID NO: 66513 154-198 gi|54549414 410-454 93% Seq ID NO:66513 38-80 gi|54548968 1-43 95% Seq ID NO: 66611 1-107 gi|33140780433-539 98% Seq ID NO: 46522 1-101 gi|18382566 65-165 89% Seq ID NO:46522 6-101 gi|18089815 1-96 88% Seq ID NO: 46522 64-98 gi|2149353440-74 91% Seq ID NO: 66645 80-246 gi|32324211 1-167 86% Seq ID NO: 6664569-219 gi|33140673 199-49 83% Seq ID NO: 66645 87-219 gi|33139639256-388 83% Seq ID NO: 66691 1-141 gi|16797830 62-202 98% Seq ID NO:66691 1-141 gi|16797831 62-200 97% Seq ID NO: 66691 1-141 gi|1679783263-201 97% Seq ID NO: 66691 1-141 gi|26000759 62-200 96% Seq ID NO:66691 1-141 gi|31442322 62-200 95% Seq ID NO: 66691 1-141 gi|3809613362-200 93% Seq ID NO: 66692 1-160 gi|5107411 189-30 100% Seq ID NO:66692 1-160 gi|2149587 189-30 95% Seq ID NO: 66692 47-160 gi|341058131719-1606 100% Seq ID NO: 66692 1-160 gi|2149585 189-30 92% Seq ID NO:66692 1-160 gi|2738785 190-30 92% Seq ID NO: 66692 1-160 gi|2738792190-30 92% Seq ID NO: 66692 1-160 gi|2738799 190-30 92% Seq ID NO: 666921-160 gi|2738800 190-30 92% Seq ID NO: 66692 2-160 gi|2707748 1766-160790% Seq ID NO: 66692 1-80 gi|48479719 191-112 98% Seq ID NO: 66692 1-80gi|37674501 194-115 98% Seq ID NO: 66692 115-160 gi|37674501 75-30 95%Seq ID NO: 66692 1-78 gi|1147729 2525-2448 98% Seq ID NO: 66692 115-160gi|1147729 2413-2368 97% Seq ID NO: 66692 1-78 gi|2232021 87-10 98% SeqID NO: 66718 182-203 gi|15766531 77-56 100% Seq ID NO: 66870 256-452gi|18381486 666-469 84% Seq ID NO: 66870 407-452 gi|18381486 257-212 95%Seq ID NO: 67072 1-90 gi|33139333 250-161 96% Seq ID NO: 46553 13-97gi|54546152 237-321 94% Seq ID NO: 67199 23-130 gi|18089328 199-92 87%Seq ID NO: 46562 169-329 gi|18081141 23-183 90% Seq ID NO: 67282 129-150gi|45563616 32-53 100% Seq ID NO: 67295 19-157 gi|54546846 257-396 85%Seq ID NO: 46571 9-99 gi|18082482 10-100 92% Seq ID NO: 67414 1-211gi|35505285 2-212 99% Seq ID NO: 67414 251-397 gi|35505285 211-357 99%Seq ID NO: 67414 446-535 gi|35505285 357-446 87% Seq ID NO: 67414152-211 gi|7144110 176-235 91% Seq ID NO: 67465 21-138 gi|28916076 1-11991% Seq ID NO: 67465 14-41 gi|159473 34-61 100% Seq ID NO: 67465 14-41gi|2454547 227-200 100% Seq ID NO: 67465 14-41 gi|551594 194-221 100%Seq ID NO: 67465 14-41 gi|551595 669-696 100% Seq ID NO: 67465 14-41gi|18032254 34-61 100% Seq ID NO: 67465 18-41 gi|18477256 279-256 100%Seq ID NO: 67465 18-41 gi|18477260 325-302 100% Seq ID NO: 67465 18-41gi|18477262 634-611 100% Seq ID NO: 67465 14-35 gi|18477259 22-1 100%Seq ID NO: 67465 14-35 gi|18477261 22-1 100% Seq ID NO: 67465 14-35gi|37780968 22-1 100% Seq ID NO: 67477 36-173 gi|28916076 140-1 90% SeqID NO: 67477 157-180 gi|159473 57-34 100% Seq ID NO: 67477 157-180gi|2454547 204-227 100% Seq ID NO: 67477 157-180 gi|551594 217-194 100%Seq ID NO: 67477 157-180 gi|551595 692-669 100% Seq ID NO: 67477 157-180gi|18032254 57-34 100% Seq ID NO: 67477 159-180 gi|18477259 1-22 100%Seq ID NO: 67477 159-180 gi|18477261 1-22 100% Seq ID NO: 67477 159-180gi|37780968 1-22 100% Seq ID NO: 67682 1-101 gi|7144157 39-139 97% SeqID NO: 67682 1-101 gi|54549688 26-126 96% Seq ID NO: 67682 22-101gi|18083086 614-535 96% Seq ID NO: 67682 25-101 gi|18090403 1-77 96% SeqID NO: 67682 54-75 gi|52129301 340-319 100% Seq ID NO: 67725 80-268gi|33139635 1-188 96% Seq ID NO: 67725 324-389 gi|33139635 188-253 98%Seq ID NO: 67725 81-268 gi|32325323 1-188 93% Seq ID NO: 67762 5-150gi|18081743 87-232 83% Seq ID NO: 67856 66-470 gi|33140136 92-496 90%Seq ID NO: 67953 1-45 gi|33139122 150-194 97% Seq ID NO: 67954 469-489gi|19384388 444-464 100% Seq ID NO: 67956 88-241 gi|32324211 1-154 89%Seq ID NO: 67956 82-233 gi|33139131 99-250 86% Seq ID NO: 67956 312-349gi|33139131 293-330 94% Seq ID NO: 67956 77-227 gi|33140673 199-49 86%Seq ID NO: 67956 95-227 gi|33139639 256-388 85% Seq ID NO: 67994 273-323gi|33139445 12-62 96% Seq ID NO: 67996 1-157 gi|32325279 266-422 98% SeqID NO: 68073 21-187 gi|7619738 1078-1244 88% Seq ID NO: 68073 119-200gi|33139362 1-82 100% Seq ID NO: 68073 12-164 gi|1235973 712-864 88% SeqID NO: 68073 12-164 gi|54547532 371-523 87% Seq ID NO: 68073 82-164gi|54549430 381-463 91% Seq ID NO: 68073 92-114 gi|6382623 174-152 100%Seq ID NO: 68073 92-114 gi|23259469 284-306 100% Seq ID NO: 68317 36-174gi|28916076 140-1 90% Seq ID NO: 68317 161-181 gi|551595 689-669 100%Seq ID NO: 68317 161-181 gi|30168621 52-32 100% Seq ID NO: 46608 1-166gi|18081390 329-164 87% Seq ID NO: 46608 1-166 gi|54545329 221-386 86%Seq ID NO: 46608 15-116 gi|21056382 128-229 85% Seq ID NO: 68351 4-128gi|18083059 281-405 85% Seq ID NO: 68370 4-68 gi|33140590 143-79 98% SeqID NO: 68423 15-54 gi|28916076 1-40 97% Seq ID NO: 68423 8-35 gi|15947334-61 100% Seq ID NO: 68423 8-35 gi|2454547 227-200 100% Seq ID NO:68423 8-35 gi|551594 194-221 100% Seq ID NO: 68423 8-35 gi|551595669-696 100% Seq ID NO: 68423 8-35 gi|18032254 34-61 100% Seq ID NO:68423 12-35 gi|18477256 279-256 100% Seq ID NO: 68423 12-35 gi|18477260325-302 100% Seq ID NO: 68423 12-35 gi|18477262 634-611 100% Seq ID NO:68423 8-29 gi|18477259 22-1 100% Seq ID NO: 68423 8-29 gi|18477261 22-1100% Seq ID NO: 68423 8-29 gi|37780968 22-1 100% Seq ID NO: 68429 22-42gi|28916076 280-260 100% Seq ID NO: 68464 546-566 gi|46985670 328-348100% Seq ID NO: 68514 1-59 gi|32324676 60-118 96% Seq ID NO: 68541305-325 gi|19103835 404-424 100% Seq ID NO: 68615 21-138 gi|289160761-119 91% Seq ID NO: 68615 14-41 gi|159473 34-61 100% Seq ID NO: 6861514-41 gi|2454547 227-200 100% Seq ID NO: 68615 14-41 gi|551594 194-221100% Seq ID NO: 68615 14-41 gi|551595 669-696 100% Seq ID NO: 6861514-41 gi|18032254 34-61 100% Seq ID NO: 68615 18-41 gi|18477256 279-256100% Seq ID NO: 68615 18-41 gi|18477260 325-302 100% Seq ID NO: 6861518-41 gi|18477262 634-611 100% Seq ID NO: 68615 14-35 gi|18477259 22-1100% Seq ID NO: 68615 14-35 gi|18477261 22-1 100% Seq ID NO: 68615 14-35gi|37780968 22-1 100% Seq ID NO: 68654 317-337 gi|30165848 173-193 100%Seq ID NO: 68761 1-197 gi|33139403 83-280 89% Seq ID NO: 68761 515-623gi|33139403 421-528 85% Seq ID NO: 68761 45-157 gi|54546450 388-500 85%Seq ID NO: 68783 328-492 gi|33140476 83-247 99% Seq ID NO: 68783 129-212gi|33140476 1-84 100% Seq ID NO: 68783 653-732 gi|33140476 245-324 98%Seq ID NO: 68833 21-138 gi|28916076 1-119 91% Seq ID NO: 68833 14-41gi|159473 34-61 100% Seq ID NO: 68833 14-41 gi|2454547 227-200 100% SeqID NO: 68833 14-41 gi|551594 194-221 100% Seq ID NO: 68833 14-41gi|551595 669-696 100% Seq ID NO: 68833 14-41 gi|18032254 34-61 100% SeqID NO: 68833 18-41 gi|18477256 279-256 100% Seq ID NO: 68833 18-41gi|18477260 325-302 100% Seq ID NO: 68833 18-41 gi|18477262 634-611 100%Seq ID NO: 68833 14-35 gi|18477259 22-1 100% Seq ID NO: 68833 14-35gi|18477261 22-1 100% Seq ID NO: 68833 14-35 gi|37780968 22-1 100% SeqID NO: 46630 218-408 gi|33139957 1-193 98% Seq ID NO: 46630 486-631gi|33139957 192-337 98% Seq ID NO: 46630 659-791 gi|33139957 325-457 98%Seq ID NO: 46630 838-944 gi|33139957 452-558 96% Seq ID NO: 46630218-406 gi|33139361 1-191 97% Seq ID NO: 46630 487-631 gi|33139361193-337 98% Seq ID NO: 46630 838-887 gi|33139361 452-501 96% Seq ID NO:69016 185-264 gi|18383042 974-895 85% Seq ID NO: 69016 230-264gi|18383054 1411-1445 97% Seq ID NO: 69046 16-120 gi|28916076 252-35288% Seq ID NO: 69089 15-54 gi|28916076 1-40 97% Seq ID NO: 69089 8-35gi|159473 34-61 100% Seq ID NO: 69089 8-35 gi|2454547 227-200 100% SeqID NO: 69089 8-35 gi|551594 194-221 100% Seq ID NO: 69089 8-35 gi|551595669-696 100% Seq ID NO: 69089 8-35 gi|18032254 34-61 100% Seq ID NO:69089 12-35 gi|18477256 279-256 100% Seq ID NO: 69089 12-35 gi|18477260325-302 100% Seq ID NO: 69089 12-35 gi|18477262 634-611 100% Seq ID NO:69089 8-29 gi|18477259 22-1 100% Seq ID NO: 69089 8-29 gi|18477261 22-1100% Seq ID NO: 69089 8-29 gi|37780968 22-1 100% Seq ID NO: 69170 15-54gi|28916076 1-40 97% Seq ID NO: 69170 8-35 gi|159473 34-61 100% Seq IDNO: 69170 8-35 gi|2454547 227-200 100% Seq ID NO: 69170 8-35 gi|551594194-221 100% Seq ID NO: 69170 8-35 gi|551595 669-696 100% Seq ID NO:69170 8-35 gi|18032254 34-61 100% Seq ID NO: 69170 12-35 gi|18477256279-256 100% Seq ID NO: 69170 12-35 gi|18477260 325-302 100% Seq ID NO:69170 12-35 gi|18477262 634-611 100% Seq ID NO: 69170 8-29 gi|1847725922-1 100% Seq ID NO: 69170 8-29 gi|18477261 22-1 100% Seq ID NO: 691708-29 gi|37780968 22-1 100% Seq ID NO: 69314 36-174 gi|28916076 140-1 92%Seq ID NO: 69314 154-181 gi|159473 61-34 100% Seq ID NO: 69314 154-181gi|2454547 200-227 100% Seq ID NO: 69314 154-181 gi|551594 221-194 100%Seq ID NO: 69314 154-181 gi|551595 696-669 100% Seq ID NO: 69314 154-181gi|18032254 61-34 100% Seq ID NO: 69314 154-177 gi|18477256 256-279 100%Seq ID NO: 69314 154-177 gi|18477260 302-325 100% Seq ID NO: 69314154-177 gi|18477262 611-634 100% Seq ID NO: 69314 160-181 gi|184772591-22 100% Seq ID NO: 69314 160-181 gi|18477261 1-22 100% Seq ID NO:69314 160-181 gi|37780968 1-22 100% Seq ID NO: 69324 1-369 gi|51093880722-352 84% Seq ID NO: 69324 1-369 gi|51093884 722-352 83% Seq ID NO:69324 1-252 gi|51093883 723-469 85% Seq ID NO: 69324 313-369 gi|51093883407-352 91% Seq ID NO: 69324 134-369 gi|51093882 587-352 84% Seq ID NO:69324 134-369 gi|54545658 621-387 83% Seq ID NO: 69327 1-128 gi|3313987474-201 90% Seq ID NO: 69351 1-124 gi|33140792 30-153 98% Seq ID NO:69351 43-124 gi|45643642 1-82 98% Seq ID NO: 69351 58-124 gi|456436461-67 92% Seq ID NO: 46651 3-78 gi|18081363 236-310 96% Seq ID NO: 466513-76 gi|54544882 242-315 95% Seq ID NO: 69410 1-141 gi|16797830 62-20298% Seq ID NO: 69410 1-141 gi|16797831 62-200 97% Seq ID NO: 69410 1-141gi|16797832 63-201 97% Seq ID NO: 69410 1-141 gi|26000759 62-200 96% SeqID NO: 69410 1-141 gi|31442322 62-200 95% Seq ID NO: 69410 1-141gi|38096133 62-200 93% Seq ID NO: 69440 221-245 gi|46987193 49-25 96%Seq ID NO: 69448 1-53 gi|18081057 88-140 94% Seq ID NO: 69449 42-186gi|35504946 49-193 97% Seq ID NO: 69463 1-156 gi|54548643 39-188 87% SeqID NO: 69463 21-156 gi|18089480 9-138 86% Seq ID NO: 69489 1-118gi|16797830 147-264 99% Seq ID NO: 69489 1-118 gi|26000761 147-262 95%Seq ID NO: 69489 1-118 gi|16797832 148-263 94% Seq ID NO: 69489 1-118gi|47118286 171-286 94% Seq ID NO: 69489 1-118 gi|31442322 147-262 94%Seq ID NO: 69489 1-118 gi|21885259 169-284 91% Seq ID NO: 69494 323-427gi|33139639 245-349 90% Seq ID NO: 69494 316-441 gi|33140673 199-74 86%Seq ID NO: 69501 113-190 gi|54549674 422-500 94% Seq ID NO: 69501 13-53gi|54549674 321-361 95% Seq ID NO: 69570 16-153 gi|7143498 58-196 85%Seq ID NO: 69592 1-136 gi|16797830 686-821 100% Seq ID NO: 69592 1-136gi|16797832 684-819 100% Seq ID NO: 69592 1-136 gi|31442320 683-818 100%Seq ID NO: 69592 1-136 gi|26000759 682-817 99% Seq ID NO: 69592 1-136gi|31442322 683-818 99% Seq ID NO: 69592 1-136 gi|38096133 685-820 97%Seq ID NO: 69592 1-120 gi|16797846 702-824 90% Seq ID NO: 69592 1-120gi|16797847 692-814 90% Seq ID NO: 69592 1-120 gi|14600264 704-826 90%Seq ID NO: 69592 1-80 gi|16797841 691-769 97% Seq ID NO: 69592 1-80gi|16797843 725-803 97% Seq ID NO: 69592 1-114 gi|16797848 691-807 90%Seq ID NO: 69592 1-80 gi|16797849 692-770 97% Seq ID NO: 69593 1-68gi|5107411 156-89 100% Seq ID NO: 69593 14-68 gi|34105813 1719-1665 100%Seq ID NO: 69593 1-51 gi|2149587 156-106 100% Seq ID NO: 69593 1-48gi|2738785 157-110 100% Seq ID NO: 69593 1-48 gi|2738792 157-110 100%Seq ID NO: 69593 1-48 gi|2738799 157-110 100% Seq ID NO: 69593 1-48gi|2738800 157-110 100% Seq ID NO: 69593 1-54 gi|2149585 156-103 96% SeqID NO: 69593 1-47 gi|31074278 1741-1695 97% Seq ID NO: 69593 1-47gi|31074279 1741-1695 97% Seq ID NO: 69593 1-47 gi|48479719 158-112 97%Seq ID NO: 69593 1-47 gi|37674501 161-115 97% Seq ID NO: 69593 1-45gi|30844179 1735-1691 97% Seq ID NO: 69716 1-121 gi|28916076 122-1 91%Seq ID NO: 69716 101-128 gi|159473 61-34 100% Seq ID NO: 69716 101-128gi|2454547 200-227 100% Seq ID NO: 69716 101-128 gi|551594 221-194 100%Seq ID NO: 69716 101-128 gi|551595 696-669 100% Seq ID NO: 69716 101-128gi|18032254 61-34 100% Seq ID NO: 69716 101-124 gi|18477256 256-279 100%Seq ID NO: 69716 101-124 gi|18477260 302-325 100% Seq ID NO: 69716101-124 gi|18477262 611-634 100% Seq ID NO: 69716 107-128 gi|184772591-22 100% Seq ID NO: 69716 107-128 gi|18477261 1-22 100% Seq ID NO:69716 107-128 gi|37780968 1-22 100% Seq ID NO: 69747 6-258 gi|51093884252-7 91% Seq ID NO: 69747 6-258 gi|18080808 295-50 89% Seq ID NO: 69747207-258 gi|18032252 2936-2885 92% Seq ID NO: 69747 219-252 gi|3004983762-29 97% Seq ID NO: 69747 219-252 gi|20064115 400-367 94% Seq ID NO:69747 219-251 gi|19267510 33-1 94% Seq ID NO: 69747 219-241 gi|1926836627-5 100% Seq ID NO: 69795 145-251 gi|52129366 202-308 85% Seq ID NO:69795 50-74 gi|52129366 161-185 100% Seq ID NO: 69821 1-101 gi|32324237442-541 98% Seq ID NO: 46679 448-820 gi|33139644 1-373 93% Seq ID NO:46679 1053-1213 gi|33139644 387-546 95% Seq ID NO: 69829 21-138gi|28916076 1-119 91% Seq ID NO: 69829 14-41 gi|159473 34-61 100% Seq IDNO: 69829 14-41 gi|2454547 227-200 100% Seq ID NO: 69829 14-41 gi|551594194-221 100% Seq ID NO: 69829 14-41 gi|551595 669-696 100% Seq ID NO:69829 14-41 gi|18032254 34-61 100% Seq ID NO: 69829 18-41 gi|18477256279-256 100% Seq ID NO: 69829 18-41 gi|18477260 325-302 100% Seq ID NO:69829 18-41 gi|18477262 634-611 100% Seq ID NO: 69829 14-35 gi|1847725922-1 100% Seq ID NO: 69829 14-35 gi|18477261 22-1 100% Seq ID NO: 6982914-35 gi|37780968 22-1 100% Seq ID NO: 69838 1-98 gi|18382603 1075-97889% Seq ID NO: 69838 25-98 gi|18382603 520-447 91% Seq ID NO: 6989321-138 gi|28916076 1-119 91% Seq ID NO: 69893 14-41 gi|159473 34-61 100%Seq ID NO: 69893 14-41 gi|2454547 227-200 100% Seq ID NO: 69893 14-41gi|551594 194-221 100% Seq ID NO: 69893 14-41 gi|551595 669-696 100% SeqID NO: 69893 14-41 gi|18032254 34-61 100% Seq ID NO: 69893 18-41gi|18477256 279-256 100% Seq ID NO: 69893 18-41 gi|18477260 325-302 100%Seq ID NO: 69893 18-41 gi|18477262 634-611 100% Seq ID NO: 69893 14-35gi|18477259 22-1 100% Seq ID NO: 69893 14-35 gi|18477261 22-1 100% SeqID NO: 69893 14-35 gi|37780968 22-1 100% Seq ID NO: 69930 1-65gi|14280572 7058-7122 100% Seq ID NO: 69972 1-247 gi|47118286 742-98897% Seq ID NO: 69972 1-247 gi|47118285 729-975 96% Seq ID NO: 699721-230 gi|16797830 720-950 96% Seq ID NO: 69972 1-230 gi|21885260 740-96996% Seq ID NO: 69972 1-230 gi|31442322 717-946 96% Seq ID NO: 699721-230 gi|26000759 716-945 96% Seq ID NO: 69972 1-230 gi|31442320 717-94696% Seq ID NO: 69972 1-230 gi|38096133 719-948 94% Seq ID NO: 69972 1-44gi|16797827 765-807 95% Seq ID NO: 69972 142-170 gi|16797844 862-890 96%Seq ID NO: 69972 227-247 gi|48479719 1112-1132 100% Seq ID NO: 699781-157 gi|33139812 208-364 98% Seq ID NO: 69998 1-79 gi|28916077 109-3187% Seq ID NO: 70006 213-380 gi|33139204 1-168 98% Seq ID NO: 7000696-177 gi|33139204 462-542 97% Seq ID NO: 70006 96-214 gi|33139268461-579 98% Seq ID NO: 70072 1-107 gi|18087923 4-110 87% Seq ID NO:70072 34-68 gi|51237696 40-74 91% Seq ID NO: 70096 21-138 gi|289160761-119 91% Seq ID NO: 70096 14-41 gi|159473 34-61 100% Seq ID NO: 7009614-41 gi|2454547 227-200 100% Seq ID NO: 70096 14-41 gi|551594 194-221100% Seq ID NO: 70096 14-41 gi|551595 669-696 100% Seq ID NO: 7009614-41 gi|18032254 34-61 100% Seq ID NO: 70096 18-41 gi|18477256 279-256100% Seq ID NO: 70096 18-41 gi|18477260 325-302 100% Seq ID NO: 7009618-41 gi|18477262 634-611 100% Seq ID NO: 70096 14-35 gi|18477259 22-1100% Seq ID NO: 70096 14-35 gi|18477261 22-1 100% Seq ID NO: 70096 14-35gi|37780968 22-1 100% Seq ID NO: 70127 759-900 gi|33140064 109-250 98%Seq ID NO: 70127 20-127 gi|33140064 1-108 98% Seq ID NO: 70140 353-392gi|32324842 178-217 97% Seq ID NO: 70163 61-147 gi|32324657 1-89 95% SeqID NO: 70163 59-153 gi|32325004 1-95 92% Seq ID NO: 70163 61-153gi|32324353 1-93 92% Seq ID NO: 70163 59-147 gi|33140382 1-89 93% Seq IDNO: 70253 17-151 gi|33139131 116-250 88% Seq ID NO: 70253 6-145gi|32324211 1-140 88% Seq ID NO: 70253 464-572 gi|32324211 394-502 87%Seq ID NO: 70253 13-231 gi|33139639 256-471 81% Seq ID NO: 70253 1-145gi|33140673 193-49 83% Seq ID NO: 70344 15-54 gi|28916076 1-40 97% SeqID NO: 70344 8-35 gi|159473 34-61 100% Seq ID NO: 70344 8-35 gi|2454547227-200 100% Seq ID NO: 70344 8-35 gi|551594 194-221 100% Seq ID NO:70344 8-35 gi|551595 669-696 100% Seq ID NO: 70344 8-35 gi|1803225434-61 100% Seq ID NO: 70344 12-35 gi|18477256 279-256 100% Seq ID NO:70344 12-35 gi|18477260 325-302 100% Seq ID NO: 70344 12-35 gi|18477262634-611 100% Seq ID NO: 70344 8-29 gi|18477259 22-1 100% Seq ID NO:70344 8-29 gi|18477261 22-1 100% Seq ID NO: 70344 8-29 gi|37780968 22-1100% Seq ID NO: 70357 1-28 gi|33139488 28-1 100% Seq ID NO: 70376160-259 gi|18090001 348-447 89% Seq ID NO: 46707 552-763 gi|1808968157-268 85% Seq ID NO: 46707 463-495 gi|18089681 21-53 96% Seq ID NO:70457 14-41 gi|159473 34-61 100% Seq ID NO: 70457 14-41 gi|2454547227-200 100% Seq ID NO: 70457 14-41 gi|551594 194-221 100% Seq ID NO:70457 14-41 gi|551595 669-696 100% Seq ID NO: 70457 14-41 gi|1803225434-61 100% Seq ID NO: 70457 18-41 gi|18477256 279-256 100% Seq ID NO:70457 18-41 gi|18477260 325-302 100% Seq ID NO: 70457 18-41 gi|18477262634-611 100% Seq ID NO: 70457 14-35 gi|18477259 22-1 100% Seq ID NO:70457 14-35 gi|18477261 22-1 100% Seq ID NO: 70457 14-35 gi|3778096822-1 100% Seq ID NO: 70484 210-232 gi|18382755 877-899 100% Seq ID NO:70513 141-344 gi|33139742 77-280 98% Seq ID NO: 70513 515-654gi|33139742 368-507 95% Seq ID NO: 70513 384-471 gi|33139742 280-367 95%Seq ID NO: 70513 701-771 gi|33139742 510-579 97% Seq ID NO: 70513 25-97gi|33139742 5-77 94% Seq ID NO: 70575 36-174 gi|28916076 140-1 91% SeqID NO: 70575 156-181 gi|159473 59-34 100% Seq ID NO: 70575 156-181gi|2454547 202-227 100% Seq ID NO: 70575 156-181 gi|551594 219-194 100%Seq ID NO: 70575 156-181 gi|551595 694-669 100% Seq ID NO: 70575 156-181gi|18032254 59-34 100% Seq ID NO: 70575 160-181 gi|18477259 1-22 100%Seq ID NO: 70575 156-177 gi|18477257 260-281 100% Seq ID NO: 70575156-177 gi|18477260 304-325 100% Seq ID NO: 70575 160-181 gi|184772611-22 100% Seq ID NO: 70575 156-177 gi|18477262 613-634 100% Seq ID NO:70575 160-181 gi|37780968 1-22 100% Seq ID NO: 70581 1-143 gi|1679783062-204 98% Seq ID NO: 70581 1-143 gi|16797831 62-202 97% Seq ID NO:70581 1-143 gi|16797832 63-203 97% Seq ID NO: 70581 1-143 gi|2600075962-202 96% Seq ID NO: 70581 1-142 gi|26000762 62-201 95% Seq ID NO:70581 1-143 gi|31442322 62-202 95% Seq ID NO: 70581 1-143 gi|3809613362-202 93% Seq ID NO: 70595 116-138 gi|27925886 275-297 100% Seq ID NO:70736 24-50 gi|32324959 70-44 100% Seq ID NO: 70765 327-392 gi|18082500283-348 90% Seq ID NO: 70765 4-65 gi|18082500 9-69 87% Seq ID NO: 708081-78 gi|33140673 126-49 91% Seq ID NO: 70808 1-110 gi|33139639 311-42084% Seq ID NO: 70808 19-78 gi|33139131 185-244 86% Seq ID NO: 708151-122 gi|28916076 122-1 85% Seq ID NO: 70815 106-129 gi|159473 57-34100% Seq ID NO: 70815 106-129 gi|2454547 204-227 100% Seq ID NO: 70815106-129 gi|551594 217-194 100% Seq ID NO: 70815 106-129 gi|551595692-669 100% Seq ID NO: 70815 106-129 gi|18032254 57-34 100% Seq ID NO:70815 108-129 gi|18477259 1-22 100% Seq ID NO: 70815 108-129 gi|184772611-22 100% Seq ID NO: 70815 108-129 gi|37780968 1-22 100% Seq ID NO:70833 21-138 gi|28916076 1-119 91% Seq ID NO: 70833 14-41 gi|15947334-61 100% Seq ID NO: 70833 14-41 gi|2454547 227-200 100% Seq ID NO:70833 14-41 gi|551594 194-221 100% Seq ID NO: 70833 14-41 gi|551595669-696 100% Seq ID NO: 70833 14-41 gi|18032254 34-61 100% Seq ID NO:70833 18-41 gi|18477256 279-256 100% Seq ID NO: 70833 18-41 gi|18477260325-302 100% Seq ID NO: 70833 18-41 gi|18477262 634-611 100% Seq ID NO:70833 14-35 gi|18477259 22-1 100% Seq ID NO: 70833 14-35 gi|1847726122-1 100% Seq ID NO: 70833 14-35 gi|37780968 22-1 100% Seq ID NO: 708381-119 gi|30169344 49-167 99% Seq ID NO: 70968 1-78 gi|33140331 512-59290% Seq ID NO: 70972 21-138 gi|28916076 1-119 91% Seq ID NO: 70972 14-41gi|159473 34-61 100% Seq ID NO: 70972 14-41 gi|2454547 227-200 100% SeqID NO: 70972 14-41 gi|551594 194-221 100% Seq ID NO: 70972 14-41gi|551595 669-696 100% Seq ID NO: 70972 14-41 gi|18032254 34-61 100% SeqID NO: 70972 18-41 gi|18477256 279-256 100% Seq ID NO: 70972 18-41gi|18477260 325-302 100% Seq ID NO: 70972 18-41 gi|18477262 634-611 100%Seq ID NO: 70972 14-35 gi|18477259 22-1 100% Seq ID NO: 70972 14-35gi|18477261 22-1 100% Seq ID NO: 70972 14-35 gi|37780968 22-1 100% SeqID NO: 70987 1-88 gi|32324011 417-505 94% Seq ID NO: 70987 137-175gi|32324011 506-544 100% Seq ID NO: 46732 61-204 gi|18081136 58-201 82%Seq ID NO: 71126 363-383 gi|19103835 404-424 100% Seq ID NO: 71130 1-173gi|33140136 324-496 88% Seq ID NO: 71131 1-173 gi|33140136 324-496 88%Seq ID NO: 71227 2-327 gi|16797830 368-43 98% Seq ID NO: 71227 2-327gi|47118285 378-55 96% Seq ID NO: 71227 2-327 gi|47118286 390-67 96% SeqID NO: 71227 2-326 gi|31442320 365-44 96% Seq ID NO: 71227 2-326gi|16797832 367-45 95% Seq ID NO: 71227 2-327 gi|21885260 388-67 95% SeqID NO: 71227 2-327 gi|31442322 366-43 95% Seq ID NO: 71227 2-327gi|38096133 366-43 92% Seq ID NO: 71227 18-165 gi|31442314 354-207 86%Seq ID NO: 71227 18-127 gi|38096139 352-244 88% Seq ID NO: 71227 18-116gi|31442318 356-259 89% Seq ID NO: 71227 18-80 gi|16797825 322-260 95%Seq ID NO: 71227 18-80 gi|31442305 344-282 95% Seq ID NO: 71227 18-77gi|16797844 348-289 95% Seq ID NO: 71227 20-100 gi|28627583 300-220 89%Seq ID NO: 71227 18-127 gi|31442313 354-246 85% Seq ID NO: 71249 1-138gi|33140698 375-512 97% Seq ID NO: 71249 147-190 gi|33140698 521-564 97%Seq ID NO: 71288 1-134 gi|16797830 935-802 97% Seq ID NO: 71288 1-134gi|26000759 931-798 97% Seq ID NO: 71288 1-134 gi|31442322 932-799 97%Seq ID NO: 71288 1-134 gi|16797832 933-800 96% Seq ID NO: 71288 1-134gi|47118286 957-824 96% Seq ID NO: 71288 1-122 gi|26000763 930-809 97%Seq ID NO: 71288 1-133 gi|16797831 933-801 95% Seq ID NO: 71288 1-134gi|38096133 934-801 94% Seq ID NO: 71288 51-75 gi|16797844 886-862 100%Seq ID NO: 71288 51-72 gi|16797845 888-867 100% Seq ID NO: 46750 397-420gi|52127553 72-95 100% Seq ID NO: 46750 12-32 gi|403087 324-344 100% SeqID NO: 46750 12-32 gi|37517247 324-344 100% Seq ID NO: 46750 12-32gi|54548553 9-29 100% Seq ID NO: 71318 153-177 gi|54547857 370-345 96%Seq ID NO: 71359 1-93 gi|32324127 486-578 98% Seq ID NO: 71359 140-211gi|18089994 172-243 88% Seq ID NO: 71371 21-138 gi|28916076 1-119 91%Seq ID NO: 71371 14-41 gi|159473 34-61 100% Seq ID NO: 71371 14-41gi|2454547 227-200 100% Seq ID NO: 71371 14-41 gi|551594 194-221 100%Seq ID NO: 71371 14-41 gi|551595 669-696 100% Seq ID NO: 71371 14-41gi|18032254 34-61 100% Seq ID NO: 71371 18-41 gi|18477256 279-256 100%Seq ID NO: 71371 18-41 gi|18477260 325-302 100% Seq ID NO: 71371 18-41gi|18477262 634-611 100% Seq ID NO: 71371 14-35 gi|18477259 22-1 100%Seq ID NO: 71371 14-35 gi|18477261 22-1 100% Seq ID NO: 71371 14-35gi|37780968 22-1 100% Seq ID NO: 71428 4-75 gi|33139530 421-350 100% SeqID NO: 71428 4-75 gi|18080188 489-418 98% Seq ID NO: 71428 4-71gi|54545749 478-411 97% Seq ID NO: 71428 25-75 gi|54545112 474-423 96%Seq ID NO: 71446 15-54 gi|28916076 1-40 97% Seq ID NO: 71446 8-35gi|159473 34-61 100% Seq ID NO: 71446 8-35 gi|2454547 227-200 100% SeqID NO: 71446 8-35 gi|551594 194-221 100% Seq ID NO: 71446 8-35 gi|551595669-696 100% Seq ID NO: 71446 8-35 gi|18032254 34-61 100% Seq ID NO:71446 12-35 gi|18477256 279-256 100% Seq ID NO: 71446 12-35 gi|18477260325-302 100% Seq ID NO: 71446 12-35 gi|18477262 634-611 100% Seq ID NO:71446 8-29 gi|18477259 22-1 100% Seq ID NO: 71446 8-29 gi|18477261 22-1100% Seq ID NO: 71446 8-29 gi|37780968 22-1 100% Seq ID NO: 71587297-344 gi|33139758 490-537 93% Seq ID NO: 71587 300-344 gi|33140062109-61 89% Seq ID NO: 71599 1-184 gi|33139461 183-366 99% Seq ID NO:71682 1-47 gi|33139445 16-62 91% Seq ID NO: 71689 86-112 gi|323242231-27 100% Seq ID NO: 71734 1-156 gi|7144157 23-179 93% Seq ID NO: 717341-156 gi|54549688 10-166 92% Seq ID NO: 71734 38-156 gi|18083086 614-49591% Seq ID NO: 71734 41-156 gi|18090403 1-117 91% Seq ID NO: 71734 1-91gi|52129301 409-319 85% Seq ID NO: 71797 19-95 gi|18081000 336-260 92%Seq ID NO: 71818 1-136 gi|16797830 686-821 99% Seq ID NO: 71818 1-136gi|16797832 684-819 99% Seq ID NO: 71818 1-136 gi|31442320 683-818 99%Seq ID NO: 71818 1-136 gi|26000759 682-817 98% Seq ID NO: 71818 1-136gi|31442322 683-818 98% Seq ID NO: 71818 1-136 gi|38096133 685-820 97%Seq ID NO: 71818 1-80 gi|16797841 691-769 97% Seq ID NO: 71818 1-80gi|16797843 725-803 97% Seq ID NO: 71818 1-98 gi|16797846 702-801 93%Seq ID NO: 71818 1-98 gi|16797847 692-791 93% Seq ID NO: 71818 1-98gi|16797848 691-790 93% Seq ID NO: 71818 1-80 gi|16797849 692-770 97%Seq ID NO: 71818 1-98 gi|14600264 704-803 93% Seq ID NO: 71848 46-108gi|28916076 140-79 92% Seq ID NO: 71985 33-55 gi|32325183 375-397 100%Seq ID NO: 71996 1-141 gi|16797830 62-202 98% Seq ID NO: 71996 1-141gi|16797831 62-200 97% Seq ID NO: 71996 1-141 gi|16797832 63-201 97% SeqID NO: 71996 1-141 gi|26000759 62-200 96% Seq ID NO: 71996 1-141gi|31442322 62-200 95% Seq ID NO: 71996 1-141 gi|38096133 62-200 93% SeqID NO: 72109 318-338 gi|24467850 41-21 100% Seq ID NO: 72212 1-47gi|51093880 468-514 95% Seq ID NO: 72212 1-39 gi|51093879 469-507 97%Seq ID NO: 72212 1-38 gi|51093883 469-506 97% Seq ID NO: 72222 1-181gi|51093884 193-12 95% Seq ID NO: 72222 1-181 gi|18080808 236-55 93% SeqID NO: 72222 129-180 gi|18032252 2942-2891 92% Seq ID NO: 72222 130-180gi|30049837 79-29 90% Seq ID NO: 72222 147-180 gi|31326537 104-71 97%Seq ID NO: 72222 130-180 gi|20064115 417-367 88% Seq ID NO: 72222130-179 gi|19267510 50-1 88% Seq ID NO: 72222 130-169 gi|19268366 44-590% Seq ID NO: 72288 5-86 gi|54546009 456-374 87% Seq ID NO: 72350 1-57gi|35505078 6-62 96% Seq ID NO: 72350 21-57 gi|35504499 1-37 100% Seq IDNO: 72377 36-174 gi|28916076 140-1 92% Seq ID NO: 72377 154-181gi|159473 61-34 100% Seq ID NO: 72377 154-181 gi|2454547 200-227 100%Seq ID NO: 72377 154-181 gi|551594 221-194 100% Seq ID NO: 72377 154-181gi|551595 696-669 100% Seq ID NO: 72377 154-181 gi|18032254 61-34 100%Seq ID NO: 72377 154-177 gi|18477256 256-279 100% Seq ID NO: 72377154-177 gi|18477260 302-325 100% Seq ID NO: 72377 154-177 gi|18477262611-634 100% Seq ID NO: 72377 160-181 gi|18477259 1-22 100% Seq ID NO:72377 160-181 gi|18477261 1-22 100% Seq ID NO: 72377 160-181 gi|377809681-22 100% Seq ID NO: 72416 345-500 gi|18089572 129-284 82% Seq ID NO:72426 15-54 gi|28916076 1-40 97% Seq ID NO: 72426 8-35 gi|159473 34-61100% Seq ID NO: 72426 8-35 gi|2454547 227-200 100% Seq ID NO: 72426 8-35gi|551594 194-221 100% Seq ID NO: 72426 8-35 gi|551595 669-696 100% SeqID NO: 72426 8-35 gi|18032254 34-61 100% Seq ID NO: 72426 12-35gi|18477256 279-256 100% Seq ID NO: 72426 12-35 gi|18477260 325-302 100%Seq ID NO: 72426 12-35 gi|18477262 634-611 100% Seq ID NO: 72426 8-29gi|18477259 22-1 100% Seq ID NO: 72426 8-29 gi|18477261 22-1 100% Seq IDNO: 72426 8-29 gi|37780968 22-1 100% Seq ID NO: 72433 106-352gi|33139639 245-488 87% Seq ID NO: 72433 99-257 gi|33140673 199-44 85%Seq ID NO: 72529 625-646 gi|27926103 451-472 100% Seq ID NO: 7254477-272 gi|33139337 247-441 97% Seq ID NO: 72544 77-245 gi|33139239401-569 98% Seq ID NO: 72579 564-667 gi|18082454 313-416 89% Seq ID NO:46809 356-378 gi|17991863 197-219 100% Seq ID NO: 72621 15-54gi|28916076 1-40 97% Seq ID NO: 72621 8-35 gi|159473 34-61 100% Seq IDNO: 72621 8-35 gi|2454547 227-200 100% Seq ID NO: 72621 8-35 gi|551594194-221 100% Seq ID NO: 72621 8-35 gi|551595 669-696 100% Seq ID NO:72621 8-35 gi|18032254 34-61 100% Seq ID NO: 72621 12-35 gi|18477256279-256 100% Seq ID NO: 72621 12-35 gi|18477260 325-302 100% Seq ID NO:72621 12-35 gi|18477262 634-611 100% Seq ID NO: 72621 8-29 gi|1847725922-1 100% Seq ID NO: 72621 8-29 gi|18477261 22-1 100% Seq ID NO: 726218-29 gi|37780968 22-1 100% Seq ID NO: 72635 1-96 gi|33140348 160-255 96%Seq ID NO: 72667 361-457 gi|54547761 394-490 86% Seq ID NO: 72706 1-121gi|28916076 122-1 91% Seq ID NO: 72706 101-128 gi|159473 61-34 100% SeqID NO: 72706 101-128 gi|2454547 200-227 100% Seq ID NO: 72706 101-128gi|551594 221-194 100% Seq ID NO: 72706 101-128 gi|551595 696-669 100%Seq ID NO: 72706 101-128 gi|18032254 61-34 100% Seq ID NO: 72706 101-124gi|18477256 256-279 100% Seq ID NO: 72706 101-124 gi|18477260 302-325100% Seq ID NO: 72706 101-124 gi|18477262 611-634 100% Seq ID NO: 72706107-128 gi|18477259 1-22 100% Seq ID NO: 72706 107-128 gi|18477261 1-22100% Seq ID NO: 72706 107-128 gi|37780968 1-22 100% Seq ID NO: 7280821-138 gi|28916076 1-119 91% Seq ID NO: 72808 14-41 gi|159473 34-61 100%Seq ID NO: 72808 14-41 gi|2454547 227-200 100% Seq ID NO: 72808 14-41gi|551594 194-221 100% Seq ID NO: 72808 14-41 gi|551595 669-696 100% SeqID NO: 72808 14-41 gi|18032254 34-61 100% Seq ID NO: 72808 18-41gi|18477256 279-256 100% Seq ID NO: 72808 18-41 gi|18477260 325-302 100%Seq ID NO: 72808 18-41 gi|18477262 634-611 100% Seq ID NO: 72808 14-35gi|18477259 22-1 100% Seq ID NO: 72808 14-35 gi|18477261 22-1 100% SeqID NO: 72808 14-35 gi|37780968 22-1 100% Seq ID NO: 72820 5-100gi|33139131 115-213 86% Seq ID NO: 72859 169-270 gi|18090422 452-553 89%Seq ID NO: 73039 1-111 gi|33139007 235-125 98% Seq ID NO: 73039 119-235gi|33139007 117-1 94% Seq ID NO: 73039 132-235 gi|33140543 104-1 98% SeqID NO: 73156 73-218 gi|32324285 154-299 97% Seq ID NO: 73156 266-408gi|32324285 298-440 96% Seq ID NO: 73156 454-534 gi|32324285 439-519100% Seq ID NO: 73156 454-516 gi|33139772 439-501 100% Seq ID NO: 73170167-212 gi|32324562 537-492 91% Seq ID NO: 73219 223-246 gi|32325459267-290 100% Seq ID NO: 73233 75-95 gi|19267575 145-125 100% Seq ID NO:73426 36-174 gi|28916076 140-1 92% Seq ID NO: 73426 154-181 gi|15947361-34 100% Seq ID NO: 73426 154-181 gi|2454547 200-227 100% Seq ID NO:73426 154-181 gi|551594 221-194 100% Seq ID NO: 73426 154-181 gi|551595696-669 100% Seq ID NO: 73426 154-181 gi|18032254 61-34 100% Seq ID NO:73426 154-177 gi|18477256 256-279 100% Seq ID NO: 73426 154-177gi|18477260 302-325 100% Seq ID NO: 73426 154-177 gi|18477262 611-634100% Seq ID NO: 73426 160-181 gi|18477259 1-22 100% Seq ID NO: 73426160-181 gi|18477261 1-22 100% Seq ID NO: 73426 160-181 gi|37780968 1-22100% Seq ID NO: 73454 1-50 gi|33140348 252-203 98% Seq ID NO: 46864225-248 gi|52127553 72-95 100% Seq ID NO: 73594 79-252 gi|32324211 1-17184% Seq ID NO: 73594 73-224 gi|33139131 99-250 85% Seq ID NO: 73594294-331 gi|33139131 293-330 94% Seq ID NO: 73616 1-285 gi|32324082243-527 97% Seq ID NO: 73616 331-370 gi|32324082 527-566 100% Seq ID NO:46871 1109-1129 gi|15766531 77-57 100% Seq ID NO: 46896 1-152gi|32325576 17-168 99% Seq ID NO: 46896 1-150 gi|54547472 44-193 90% SeqID NO: 46896 1-150 gi|18083068 22-171 88% Seq ID NO: 73960 36-174gi|28916076 140-1 92% Seq ID NO: 73960 154-181 gi|159473 61-34 100% SeqID NO: 73960 154-181 gi|2454547 200-227 100% Seq ID NO: 73960 154-181gi|551594 221-194 100% Seq ID NO: 73960 154-181 gi|551595 696-669 100%Seq ID NO: 73960 154-181 gi|18032254 61-34 100% Seq ID NO: 73960 154-177gi|18477256 256-279 100% Seq ID NO: 73960 154-177 gi|18477260 302-325100% Seq ID NO: 73960 154-177 gi|18477262 611-634 100% Seq ID NO: 73960160-181 gi|18477259 1-22 100% Seq ID NO: 73960 160-181 gi|18477261 1-22100% Seq ID NO: 73960 160-181 gi|37780968 1-22 100% Seq ID NO: 739861-101 gi|33139929 341-441 99% Seq ID NO: 73986 151-236 gi|33139929439-524 100% Seq ID NO: 73986 16-101 gi|32324461 1-86 98% Seq ID NO:73987 4-146 gi|7144133 498-641 91% Seq ID NO: 73987 52-152 gi|183824141346-1446 93% Seq ID NO: 74514 17-120 gi|7143618 355-252 86% Seq ID NO:74514 155-180 gi|18080329 30-5 96% Seq ID NO: 74514 154-187 gi|18080516211-178 94% Seq ID NO: 74514 155-187 gi|7143678 478-446 93% Seq ID NO:74633 161-181 gi|38512984 421-441 100% Seq ID NO: 74832 1-56 gi|7641359475-420 92% Seq ID NO: 74832 10-53 gi|15003596 5-48 97% Seq ID NO: 748321-57 gi|14085900 55-111 91% Seq ID NO: 74832 1-53 gi|14086110 51-103 92%Seq ID NO: 74832 1-53 gi|19265574 41-93 92% Seq ID NO: 74832 1-57gi|39747527 43-99 91% Seq ID NO: 74832 1-56 gi|15767660 65-120 91% SeqID NO: 74832 1-56 gi|18495580 501-446 91% Seq ID NO: 74832 16-53gi|15003618 64-101 97% Seq ID NO: 74832 1-53 gi|34025909 42-94 90% SeqID NO: 74992 72-256 gi|33139639 245-429 92% Seq ID NO: 74992 63-214gi|33140673 201-50 89% Seq ID NO: 75035 76-97 gi|19266802 444-465 100%Seq ID NO: 75105 431-590 gi|33139939 312-471 97% Seq ID NO: 75105240-388 gi|33139939 165-312 96% Seq ID NO: 75105 799-910 gi|33139939472-583 99% Seq ID NO: 75105 64-176 gi|33139939 52-164 94% Seq ID NO:75105 829-959 gi|18090520 33-163 87% Seq ID NO: 75106 327-348gi|33140085 432-453 100% Seq ID NO: 75118 318-457 gi|14280575 2710-2849100% Seq ID NO: 75118 133-231 gi|14280575 2614-2712 100% Seq ID NO:75118 2-77 gi|14280575 2538-2613 100% Seq ID NO: 75118 318-355gi|14280570 3427-3465 92% Seq ID NO: 75173 161-181 gi|32324086 604-584100% Seq ID NO: 75190 67-108 gi|7144329 64-105 100% Seq ID NO: 75203175-233 gi|21393342 99-157 88% Seq ID NO: 75203 190-233 gi|21393558107-150 91% Seq ID NO: 75203 175-201 gi|7797714 281-307 96% Seq ID NO:75289 66-470 gi|33140136 92-496 90% Seq ID NO: 75439 292-321 gi|7922575190-218 96% Seq ID NO: 75663 438-478 gi|18081901 558-518 92% Seq ID NO:47026 543-822 gi|18382431 565-486 91% Seq ID NO: 75916 57-196gi|32323999 1-140 95% Seq ID NO: 75950 6-261 gi|32324211 1-256 84% SeqID NO: 75950 17-145 gi|33139131 116-244 89% Seq ID NO: 75950 13-250gi|33139639 256-493 81% Seq ID NO: 75950 1-145 gi|33140673 193-49 84%Seq ID NO: 75960 1-75 gi|30028941 235-309 96% Seq ID NO: 75960 1-75gi|33140204 235-309 96% Seq ID NO: 47033 313-382 gi|32324686 13-82 98%Seq ID NO: 76094 66-470 gi|33140136 92-496 89% Seq ID NO: 76242 181-203gi|27001433 261-283 100% Seq ID NO: 76281 16-139 gi|54546240 15-138 90%Seq ID NO: 76281 16-139 gi|18089390 11-134 89% Seq ID NO: 76311 128-240gi|32324311 117-229 88% Seq ID NO: 76311 1-46 gi|32324311 55-100 93% SeqID NO: 76311 130-241 gi|33140078 156-267 87% Seq ID NO: 76311 152-241gi|33139028 10-99 90% Seq ID NO: 47066 255-278 gi|7143978 211-234 100%Seq ID NO: 76358 1-57 gi|32324009 418-474 98% Seq ID NO: 76358 1-53gi|33139304 439-491 98% Seq ID NO: 76409 5-25 gi|19267937 180-160 100%Seq ID NO: 76432 620-750 gi|33139974 141-11 99% Seq ID NO: 76432 339-516gi|33139974 316-139 91% Seq ID NO: 76470 296-390 gi|33140673 141-44 85%Seq ID NO: 76471 1-87 gi|35505085 14-100 97% Seq ID NO: 76471 8-87gi|35504991 21-100 95% Seq ID NO: 76471 49-87 gi|35505046 1-39 100% SeqID NO: 76556 591-615 gi|19103651 327-352 96% Seq ID NO: 76634 17-151gi|33139131 116-250 88% Seq ID NO: 76634 6-145 gi|32324211 1-140 88% SeqID NO: 76634 13-279 gi|33139639 256-516 80% Seq ID NO: 76634 1-145gi|33140673 193-49 84% Seq ID NO: 76667 3-64 gi|32325283 20-81 96% SeqID NO: 76718 1-84 gi|33139696 197-280 94% Seq ID NO: 76775 1-149gi|35505158 357-505 99% Seq ID NO: 47105 82-256 gi|18080493 64-238 87%Seq ID NO: 47105 16-36 gi|18080493 48-68 100% Seq ID NO: 47105 82-254gi|7144379 51-223 87% Seq ID NO: 76970 61-82 gi|18090014 19-40 100% SeqID NO: 77032 570-679 gi|33139969 11-120 96% Seq ID NO: 77038 5-123gi|2707748 1563-1445 99% Seq ID NO: 77038 5-123 gi|34105813 1562-144499% Seq ID NO: 77038 5-123 gi|34105815 1562-1444 99% Seq ID NO: 7703817-123 gi|30844179 1550-1444 95% Seq ID NO: 77038 17-123 gi|341058071434-1328 95% Seq ID NO: 77038 17-123 gi|34105808 1545-1439 95% Seq IDNO: 77038 17-123 gi|34105810 1561-1455 95% Seq ID NO: 77038 17-123gi|31376322 1492-1386 94% Seq ID NO: 77038 17-123 gi|31376323 1488-138294% Seq ID NO: 77038 17-123 gi|34105806 1558-1452 93% Seq ID NO: 7703817-123 gi|51093982 386-280 93% Seq ID NO: 77038 17-123 gi|69839591511-1405 92% Seq ID NO: 77038 17-123 gi|22544385 274-168 92% Seq ID NO:77038 17-123 gi|30169951 41-147 92% Seq ID NO: 77038 17-123 gi|313763251503-1397 92% Seq ID NO: 77198 308-501 gi|33140188 263-70 96% Seq ID NO:77198 302-469 gi|33140188 464-297 87% Seq ID NO: 77198 4-91 gi|33140188353-266 97% Seq ID NO: 77198 1-88 gi|33140188 551-464 90% Seq ID NO:77198 389-469 gi|33140188 572-492 91% Seq ID NO: 77198 1-60 gi|33140188161-102 90% Seq ID NO: 77199 5-155 gi|33140188 113-263 96% Seq ID NO:77199 372-521 gi|33140188 266-415 96% Seq ID NO: 77199 403-519gi|33140188 102-218 94% Seq ID NO: 77199 5-161 gi|33140188 308-464 88%Seq ID NO: 77199 375-483 gi|33140188 464-572 91% Seq ID NO: 77199 5-74gi|33140188 503-572 91% Seq ID NO: 77213 199-266 gi|18090001 163-230 85%Seq ID NO: 77253 1-146 gi|33139770 381-526 98% Seq ID NO: 77260 4-60gi|33140397 536-592 98% Seq ID NO: 77260 4-58 gi|33139624 522-576 98%Seq ID NO: 77342 1-101 gi|7144157 39-139 97% Seq ID NO: 77342 1-101gi|54549688 26-126 96% Seq ID NO: 77342 22-101 gi|18083086 614-535 96%Seq ID NO: 77342 25-101 gi|18090403 1-77 96% Seq ID NO: 77342 54-75gi|52129301 340-319 100% Seq ID NO: 77382 7-92 gi|54546009 456-370 88%Seq ID NO: 77397 2-222 gi|7144372 296-516 90% Seq ID NO: 77397 2-214gi|18081853 341-552 90% Seq ID NO: 77478 1-43 gi|7144081 50-92 90% SeqID NO: 77496 474-494 gi|39747177 32-12 100% Seq ID NO: 77564 203-337gi|33140348 119-253 86% Seq ID NO: 77649 1-160 gi|18081259 449-291 93%Seq ID NO: 77725 1-24 gi|33139696 554-577 100% Seq ID NO: 78000 1-163gi|32324951 29-197 94% Seq ID NO: 47181 709-827 gi|18080339 13-131 88%Seq ID NO: 78139 149-236 gi|32324369 16-103 97% Seq ID NO: 78157 789-809gi|34025242 232-212 100% Seq ID NO: 78157 789-809 gi|32322566 529-509100% Seq ID NO: 47188 202-341 gi|32324409 329-468 95% Seq ID NO: 471881-90 gi|33139667 502-591 98% Seq ID NO: 78415 107-386 gi|33140127 75-35496% Seq ID NO: 78415 1-44 gi|33140127 33-76 95% Seq ID NO: 78436 8-67gi|28916076 60-1 90% Seq ID NO: 78818 18-77 gi|30028941 53-112 95% SeqID NO: 78818 18-77 gi|33140204 53-112 95% Seq ID NO: 78927 1-110gi|32325405 74-183 95% Seq ID NO: 78955 118-142 gi|18089765 337-361 100%Seq ID NO: 79093 105-368 gi|33139802 311-574 98% Seq ID NO: 79093 1-63gi|33139802 249-311 100% Seq ID NO: 79093 105-344 gi|33140772 311-55098% Seq ID NO: 79160 206-305 gi|33139131 107-206 87% Seq ID NO: 79160429-458 gi|33139131 300-328 93% Seq ID NO: 79160 291-313 gi|7143644362-384 100% Seq ID NO: 79239 183-270 gi|18083061 2-89 87% Seq ID NO:79275 94-131 gi|15767717 372-409 92% Seq ID NO: 79292 768-882gi|18082714 142-256 88% Seq ID NO: 79369 124-211 gi|32325459 290-208 84%Seq ID NO: 79371 10-86 gi|54547347 34-110 88% Seq ID NO: 79419 1-110gi|33139485 331-440 95% Seq ID NO: 79419 161-248 gi|33139485 438-525 98%Seq ID NO: 79533 1-179 gi|33140348 253-75 89% Seq ID NO: 79533 122-160gi|32324498 44-5 92% Seq ID NO: 79545 1-68 gi|7143918 49-116 91% Seq IDNO: 79545 1-42 gi|18382970 1050-1009 95% Seq ID NO: 47286 317-516gi|18088906 218-417 88% Seq ID NO: 47286 106-178 gi|18088906 140-212 91%Seq ID NO: 79576 77-189 gi|32324732 377-489 98% Seq ID NO: 79576 1-69gi|32324732 301-369 100% Seq ID NO: 79602 60-139 gi|7143654 84-5 90% SeqID NO: 79614 287-458 gi|30169344 167-338 97% Seq ID NO: 79614 1-119gi|30169344 49-167 100% Seq ID NO: 79729 1-28 gi|33140324 288-315 100%Seq ID NO: 79741 94-257 gi|33140312 390-553 96% Seq ID NO: 79741 94-145gi|33140619 501-552 94% Seq ID NO: 79818 1-75 gi|32325315 37-111 100%Seq ID NO: 79818 18-75 gi|35504416 1-58 98% Seq ID NO: 79818 32-75gi|35504544 15-58 100% Seq ID NO: 79818 36-75 gi|35504483 4-43 100% SeqID NO: 79885 1-141 gi|33139639 311-451 90% Seq ID NO: 79885 1-78gi|33140673 126-49 92% Seq ID NO: 79885 32-78 gi|32324211 94-140 95% SeqID NO: 79893 1-62 gi|32324007 469-408 95% Seq ID NO: 80017 17-50gi|7144038 422-455 100% Seq ID NO: 80102 5-88 gi|33139645 485-570 91%Seq ID NO: 80192 397-417 gi|17991359 548-528 100% Seq ID NO: 47324 1-98gi|35504434 329-426 91% Seq ID NO: 47325 416-525 gi|7143492 304-195 90%Seq ID NO: 80394 1-240 gi|33139962 21-260 96% Seq ID NO: 80394 295-405gi|33139962 264-374 97% Seq ID NO: 80394 199-239 gi|18089348 186-227 92%Seq ID NO: 80422 39-59 gi|27926823 344-364 100% Seq ID NO: 80601 50-200gi|33140348 105-255 94% Seq ID NO: 80601 8-63 gi|33140348 53-107 96% SeqID NO: 80601 50-77 gi|32324498 17-44 96% Seq ID NO: 80750 67-235gi|54546057 58-226 88% Seq ID NO: 80750 12-33 gi|33952487 540-561 100%Seq ID NO: 80750 12-33 gi|34025939 541-562 100% Seq ID NO: 80750 205-226gi|453579 1960-1939 100% Seq ID NO: 80985 458-478 gi|8005746 172-192100% Seq ID NO: 81077 19-132 gi|18080124 72-186 84% Seq ID NO: 8107735-63 gi|18082691 129-157 96% Seq ID NO: 81120 854-916 gi|32324217 4-66100% Seq ID NO: 81120 862-883 gi|7797944 63-84 100% Seq ID NO: 474001-136 gi|33140804 236-371 100% Seq ID NO: 47400 194-337 gi|33140804372-515 95% Seq ID NO: 81337 1-154 gi|32324094 298-451 97% Seq ID NO:81337 2-135 gi|18381585 1162-1029 87% Seq ID NO: 81377 309-329gi|30165848 173-193 100% Seq ID NO: 81381 1-110 gi|33139131 114-225 87%Seq ID NO: 81412 136-188 gi|32324211 450-502 90% Seq ID NO: 81471 50-242gi|18088614 91-283 87% Seq ID NO: 81471 589-666 gi|18088614 289-366 93%Seq ID NO: 81471 606-666 gi|32183705 270-330 88% Seq ID NO: 81477 17-237gi|18082509 367-587 90% Seq ID NO: 81477 17-237 gi|54548304 360-580 90%Seq ID NO: 81477 17-231 gi|54548230 373-587 88% Seq ID NO: 81624 114-264gi|33139681 415-565 98% Seq ID NO: 81624 1-72 gi|33139681 346-417 98%Seq ID NO: 81634 1-153 gi|7110849 31-183 91% Seq ID NO: 81634 1-162gi|54546003 335-496 87% Seq ID NO: 81648 15-54 gi|28916076 1-40 97% SeqID NO: 81648 8-35 gi|159473 34-61 100% Seq ID NO: 81648 8-35 gi|2454547227-200 100% Seq ID NO: 81648 8-35 gi|551594 194-221 100% Seq ID NO:81648 8-35 gi|551595 669-696 100% Seq ID NO: 81648 8-35 gi|1803225434-61 100% Seq ID NO: 81648 12-35 gi|18477256 279-256 100% Seq ID NO:81648 12-35 gi|18477260 325-302 100% Seq ID NO: 81648 12-35 gi|18477262634-611 100% Seq ID NO: 81648 8-29 gi|18477259 22-1 100% Seq ID NO:81648 8-29 gi|18477261 22-1 100% Seq ID NO: 81648 8-29 gi|37780968 22-1100% Seq ID NO: 81702 17-91 gi|33139131 116-190 92% Seq ID NO: 817026-97 gi|32324211 1-92 87% Seq ID NO: 81703 84-347 gi|33139639 256-51684% Seq ID NO: 81703 77-216 gi|32324211 1-140 89% Seq ID NO: 8170371-222 gi|33139131 99-250 87% Seq ID NO: 81703 66-216 gi|33140673 199-4986% Seq ID NO: 81813 589-938 gi|33139425 112-463 94% Seq ID NO: 81813429-539 gi|33139425 1-111 97% Seq ID NO: 81813 664-822 gi|32323963132-290 98% Seq ID NO: 81813 877-972 gi|32323963 290-385 98% Seq ID NO:81813 877-971 gi|32325615 290-384 98% Seq ID NO: 81813 427-539gi|33139274 1-113 97% Seq ID NO: 81813 429-606 gi|33140556 1-181 92% SeqID NO: 81813 666-822 gi|32324994 135-291 98% Seq ID NO: 81813 428-539gi|32324994 1-112 99% Seq ID NO: 81813 877-947 gi|33139381 232-302 98%Seq ID NO: 81813 487-539 gi|33139381 1-53 100% Seq ID NO: 81813 427-523gi|33140583 1-97 96% Seq ID NO: 81817 452-473 gi|33952321 139-160 100%Seq ID NO: 81839 16-77 gi|7143651 36-97 90% Seq ID NO: 81895 515-875gi|32325036 211-571 95% Seq ID NO: 81895 472-827 gi|32325036 216-571 94%Seq ID NO: 81895 563-897 gi|32325036 211-545 94% Seq ID NO: 81895514-693 gi|32325036 18-197 93% Seq ID NO: 81895 482-645 gi|3232503634-197 94% Seq ID NO: 81895 610-789 gi|32325036 18-197 92% Seq ID NO:81895 562-741 gi|32325036 18-197 92% Seq ID NO: 81895 658-837gi|32325036 18-197 92% Seq ID NO: 81895 706-885 gi|32325036 18-197 92%Seq ID NO: 81895 754-897 gi|32325036 18-161 93% Seq ID NO: 81895 472-597gi|32325036 72-197 93% Seq ID NO: 81895 48-246 gi|32325036 231-428 86%Seq ID NO: 81895 1-246 gi|32325036 232-476 83% Seq ID NO: 81895 137-270gi|32325036 224-356 87% Seq ID NO: 81895 48-207 gi|32325036 39-197 83%Seq ID NO: 81895 138-253 gi|32325036 33-147 86% Seq ID NO: 81895 1-157gi|32325036 40-196 80% Seq ID NO: 81895 470-716 gi|33140561 167-413 96%Seq ID NO: 81895 520-764 gi|33140561 169-413 95% Seq ID NO: 81895568-812 gi|33140561 169-413 95% Seq ID NO: 81895 616-860 gi|33140561169-413 95% Seq ID NO: 81895 664-897 gi|33140561 169-402 95% Seq ID NO:81895 457-668 gi|33140561 202-413 95% Seq ID NO: 81895 1-230 gi|33140561185-413 83% Seq ID NO: 81895 691-712 gi|33140561 412-433 100% Seq ID NO:81895 835-856 gi|33140561 412-433 100% Seq ID NO: 81895 547-568gi|33140561 412-433 100% Seq ID NO: 81895 787-808 gi|33140561 412-433100% Seq ID NO: 81895 643-664 gi|33140561 412-433 100% Seq ID NO: 81895739-760 gi|33140561 412-433 100% Seq ID NO: 81895 595-616 gi|33140561412-433 100% Seq ID NO: 81895 499-520 gi|33140561 412-433 100% Seq IDNO: 81895 534-896 gi|7143587 175-537 83% Seq ID NO: 81895 486-848gi|7143587 175-537 83% Seq ID NO: 81895 484-752 gi|7143944 428-699 82%Seq ID NO: 81895 724-896 gi|7143944 332-504 84% Seq ID NO: 81895 630-728gi|7143944 133-231 88% Seq ID NO: 81895 582-671 gi|7143944 133-222 89%Seq ID NO: 81895 534-623 gi|7143944 133-222 89% Seq ID NO: 81895 774-863gi|7143944 133-222 89% Seq ID NO: 81895 486-575 gi|7143944 133-222 89%Seq ID NO: 81895 822-896 gi|7143944 133-207 90% Seq ID NO: 81895 678-767gi|7143944 133-222 86% Seq ID NO: 81895 470-569 gi|33139222 297-396 97%Seq ID NO: 81895 520-617 gi|33139222 299-396 95% Seq ID NO: 81895760-857 gi|33139222 299-396 95% Seq ID NO: 81895 616-713 gi|33139222299-396 95% Seq ID NO: 81895 568-665 gi|33139222 299-396 95% Seq ID NO:81895 712-809 gi|33139222 299-396 93% Seq ID NO: 81895 664-761gi|33139222 299-396 93% Seq ID NO: 81895 808-897 gi|33139222 299-388 95%Seq ID NO: 81895 457-521 gi|33139222 332-396 92% Seq ID NO: 82088 73-93gi|33140421 13-33 100% Seq ID NO: 82094 18-129 gi|33139131 139-250 90%Seq ID NO: 82094 31-123 gi|32324211 48-140 90% Seq ID NO: 82094 1-152gi|33139639 266-417 80% Seq ID NO: 82109 19-196 gi|18087933 464-287 90%Seq ID NO: 82109 19-184 gi|54547517 403-568 88% Seq ID NO: 82109 19-166gi|18081843 491-638 89% Seq ID NO: 82109 19-139 gi|18083082 505-625 90%Seq ID NO: 82109 66-191 gi|51334233 404-529 81% Seq ID NO: 82142 1-79gi|33140136 402-480 93% Seq ID NO: 82163 122-267 gi|33139131 99-244 87%Seq ID NO: 82163 128-267 gi|32324211 1-140 87% Seq ID NO: 82166 2-106gi|33139639 245-349 90% Seq ID NO: 82166 1-120 gi|33140673 193-74 84%Seq ID NO: 82199 66-470 gi|33140136 92-496 89% Seq ID NO: 82221 1-87gi|18090662 244-330 88% Seq ID NO: 82256 59-122 gi|14280572 6490-655392% Seq ID NO: 82260 698-718 gi|37972243 197-217 100% Seq ID NO: 82260698-718 gi|45566049 213-233 100% Seq ID NO: 47476 3-211 gi|18089811280-488 87% Seq ID NO: 47476 3-167 gi|18080518 317-481 88% Seq ID NO:82506 1-174 gi|33140348 248-75 89% Seq ID NO: 82506 117-155 gi|3232449844-5 92% Seq ID NO: 82529 1-86 gi|32324692 54-139 97% Seq ID NO: 825297-86 gi|18082298 52-131 90% Seq ID NO: 82799 1-40 gi|33140348 254-215100% Seq ID NO: 82799 78-135 gi|33140348 181-123 90% Seq ID NO: 82849599-663 gi|18080145 213-277 87% Seq ID NO: 82849 358-380 gi|40670113439-417 100% Seq ID NO: 82865 1-127 gi|33140348 255-128 93% Seq ID NO:82865 185-227 gi|33140348 70-27 97% Seq ID NO: 82929 772-793 gi|893075949-28 100% Seq ID NO: 82929 772-793 gi|34026216 41-20 100% Seq ID NO:82965 302-322 gi|46984484 89-109 100% Seq ID NO: 83225 316-338gi|27540675 36-58 100% Seq ID NO: 83226 299-358 gi|32325383 584-525 98%Seq ID NO: 83249 3-237 gi|33139581 1-235 99% Seq ID NO: 83274 1-55gi|21393574 296-350 89% Seq ID NO: 83339 1-268 gi|51093880 722-452 86%Seq ID NO: 83339 1-252 gi|51093883 723-469 85% Seq ID NO: 83339 134-268gi|51093882 587-452 87% Seq ID NO: 83339 134-252 gi|54545658 621-503 87%Seq ID NO: 83379 40-139 gi|18081804 201-300 89% Seq ID NO: 83379 40-130gi|18088266 516-606 90% Seq ID NO: 83382 42-429 gi|32325123 663-276 87%Seq ID NO: 83382 132-519 gi|32325123 663-276 87% Seq ID NO: 83382 1-384gi|32325123 659-276 87% Seq ID NO: 83382 90-474 gi|32325123 660-276 86%Seq ID NO: 83382 3-294 gi|32325123 567-276 89% Seq ID NO: 83382 177-526gi|32325123 663-314 86% Seq ID NO: 83382 1-204 gi|32325123 479-276 90%Seq ID NO: 83382 312-530 gi|32325123 663-445 84% Seq ID NO: 83382 1-87gi|32325123 344-258 94% Seq ID NO: 83382 17-165 gi|32325123 236-84 83%Seq ID NO: 83382 308-435 gi|32325123 214-84 83% Seq ID NO: 83382 332-384gi|32325123 236-183 90% Seq ID NO: 83382 377-429 gi|32325123 236-183 90%Seq ID NO: 83382 219-345 gi|32325123 213-84 81% Seq ID NO: 83382 489-519gi|32325123 213-183 100% Seq ID NO: 83382 129-159 gi|32325123 213-183100% Seq ID NO: 83382 17-75 gi|32325123 143-84 88% Seq ID NO: 83382242-294 gi|32325123 236-183 88% Seq ID NO: 83382 197-255 gi|32325123143-84 87% Seq ID NO: 83382 467-525 gi|32325123 143-84 87% Seq ID NO:83382 1-24 gi|32325123 206-183 100% Seq ID NO: 83382 9-30 gi|32325123105-84 100% Seq ID NO: 83382 129-519 gi|33139983 660-271 87% Seq ID NO:83382 38-519 gi|33139983 661-178 84% Seq ID NO: 83382 78-474 gi|33139983666-271 86% Seq ID NO: 83382 173-526 gi|33139983 661-309 87% Seq ID NO:83382 308-530 gi|33139983 661-440 87% Seq ID NO: 83382 351-519gi|33139983 663-495 88% Seq ID NO: 83382 1-337 gi|33139983 429-87 82%Seq ID NO: 83382 129-513 gi|33139225 635-252 87% Seq ID NO: 83382 1-378gi|33139225 628-252 87% Seq ID NO: 83382 38-423 gi|33139225 636-252 86%Seq ID NO: 83382 82-464 gi|33139225 637-256 86% Seq ID NO: 83382 352-519gi|33139225 637-470 88% Seq ID NO: 83382 3-159 gi|33139225 312-153 82%Seq ID NO: 83382 28-384 gi|32324170 684-328 87% Seq ID NO: 83382 129-474gi|32324170 673-328 87% Seq ID NO: 83382 163-519 gi|32324170 684-328 87%Seq ID NO: 83382 81-519 gi|32324170 676-235 84% Seq ID NO: 83382 219-528gi|32324170 673-364 88% Seq ID NO: 83382 3-439 gi|32324170 574-132 83%Seq ID NO: 83382 253-526 gi|32324170 684-411 87% Seq ID NO: 83382 1-357gi|32324170 486-124 82% Seq ID NO: 83382 350-530 gi|32324170 677-497 87%Seq ID NO: 83382 1-78 gi|32324170 666-589 96% Seq ID NO: 83382 433-528gi|32324170 684-589 90% Seq ID NO: 83382 222-519 gi|33140346 602-302 86%Seq ID NO: 83382 132-429 gi|33140346 602-302 86% Seq ID NO: 83382 42-428gi|33140346 602-210 83% Seq ID NO: 83382 1-338 gi|33140346 553-210 83%Seq ID NO: 83382 357-526 gi|33140346 602-433 90% Seq ID NO: 83382 17-87gi|33140346 168-97 91% Seq ID NO: 83382 377-439 gi|33140346 168-105 90%Seq ID NO: 83382 492-528 gi|33140346 602-566 100% Seq ID NO: 83382467-529 gi|33140346 168-105 89% Seq ID NO: 83382 332-394 gi|33140346168-105 89% Seq ID NO: 83382 107-169 gi|33140346 168-105 89% Seq ID NO:83382 197-259 gi|33140346 168-105 89% Seq ID NO: 83382 377-434gi|33140346 75-17 90% Seq ID NO: 83382 17-74 gi|33140346 75-17 90% SeqID NO: 83382 1-114 gi|33140346 418-302 82% Seq ID NO: 83382 1-34gi|33140346 138-105 97% Seq ID NO: 83382 332-389 gi|33140346 75-17 88%Seq ID NO: 83382 107-164 gi|33140346 75-17 88% Seq ID NO: 83382 467-524gi|33140346 75-17 88% Seq ID NO: 83382 197-254 gi|33140346 75-17 88% SeqID NO: 83382 1-29 gi|33140346 45-17 96% Seq ID NO: 83382 197-384gi|33140426 442-254 88% Seq ID NO: 83382 332-519 gi|33140426 442-254 88%Seq ID NO: 83382 287-474 gi|33140426 442-254 87% Seq ID NO: 83382242-429 gi|33140426 442-254 88% Seq ID NO: 83382 107-294 gi|33140426442-254 87% Seq ID NO: 83382 1-159 gi|33140426 412-254 89% Seq ID NO:83382 17-204 gi|33140426 442-254 86% Seq ID NO: 83382 377-528gi|33140426 442-290 88% Seq ID NO: 83382 62-249 gi|33140426 442-254 86%Seq ID NO: 83382 3-87 gi|33140426 320-236 91% Seq ID NO: 83382 28-169gi|32325556 222-78 83% Seq ID NO: 83382 298-439 gi|32325556 222-78 82%Seq ID NO: 83382 208-357 gi|32325556 222-70 80% Seq ID NO: 83382 478-519gi|32325556 222-181 92% Seq ID NO: 83382 17-153 gi|32324836 149-10 83%Seq ID NO: 83382 377-513 gi|32324836 149-10 82% Seq ID NO: 83382 197-329gi|32324836 149-14 81% Seq ID NO: 83382 107-243 gi|32324836 149-10 81%Seq ID NO: 83382 467-519 gi|32324836 149-97 90% Seq ID NO: 83382 1-104gi|32324836 120-14 81% Seq ID NO: 83382 38-161 gi|33139487 219-93 83%Seq ID NO: 83382 350-384 gi|33139487 222-188 97% Seq ID NO: 83382219-339 gi|33139487 218-95 81% Seq ID NO: 83382 398-429 gi|33139487219-188 96% Seq ID NO: 83382 43-153 gi|33139115 124-11 83% Seq ID NO:83382 133-243 gi|33139115 124-11 82% Seq ID NO: 83382 403-513gi|33139115 124-11 82% Seq ID NO: 83382 223-329 gi|33139115 124-15 82%Seq ID NO: 83382 493-519 gi|33139115 124-98 100% Seq ID NO: 83382358-384 gi|33139115 124-98 100% Seq ID NO: 83407 61-104 gi|5454644655-98 90% Seq ID NO: 83502 3-97 gi|18090256 296-390 90% Seq ID NO: 835311-55 gi|33140348 201-255 98% Seq ID NO: 83556 31-110 gi|54547037 1-8086% Seq ID NO: 83556 32-110 gi|54547033 1-79 86% Seq ID NO: 83658281-303 gi|33138709 165-143 100% Seq ID NO: 83658 337-357 gi|28916076280-260 100% Seq ID NO: 83678 289-309 gi|37853736 373-393 100% Seq IDNO: 83731 1-134 gi|32324631 49-182 94% Seq ID NO: 83731 1-107gi|33139363 49-152 92% Seq ID NO: 83737 18-39 gi|31326189 232-211 100%Seq ID NO: 83765 226-357 gi|7144169 364-495 87% Seq ID NO: 83765 274-335gi|18081294 455-516 91% Seq ID NO: 83765 274-358 gi|18089822 443-527 91%Seq ID NO: 83781 408-429 gi|17971089 242-221 100% Seq ID NO: 83828539-605 gi|33139668 557-491 92% Seq ID NO: 83828 386-419 gi|33139668530-497 94% Seq ID NO: 83828 347-368 gi|33139668 530-509 100% Seq ID NO:83828 35-56 gi|33139668 530-509 100% Seq ID NO: 83828 230-251gi|33139668 530-509 100% Seq ID NO: 83828 566-605 gi|33139848 530-49197% Seq ID NO: 83855 27-51 gi|33140021 1-25 100% Seq ID NO: 47578 1-101gi|18382566 65-165 88% Seq ID NO: 47578 7-101 gi|18089815 2-96 88% SeqID NO: 47578 64-98 gi|21493534 40-74 91% Seq ID NO: 84076 75-164gi|18081616 391-302 87% Seq ID NO: 84205 295-402 gi|18079959 358-465 82%Seq ID NO: 84241 159-179 gi|52127565 269-249 100% Seq ID NO: 84390 75-97gi|34028088 28-50 100% Seq ID NO: 84419 109-281 gi|33139639 255-428 88%Seq ID NO: 84419 96-242 gi|33140673 195-49 89% Seq ID NO: 84522 1-73gi|33140257 196-268 98% Seq ID NO: 84522 44-73 gi|32324393 1-30 100% SeqID NO: 84576 73-112 gi|21393592 395-434 90% Seq ID NO: 84645 275-323gi|7143520 433-385 91% Seq ID NO: 84648 6-28 gi|35504845 119-97 100% SeqID NO: 84664 125-168 gi|33139131 293-336 93% Seq ID NO: 84696 527-650gi|35504896 483-606 96% Seq ID NO: 84696 1-127 gi|35504896 213-339 96%Seq ID NO: 84696 189-267 gi|35504896 337-415 97% Seq ID NO: 84696473-526 gi|35504896 413-466 98% Seq ID NO: 84696 40-98 gi|7143674296-354 93% Seq ID NO: 84696 42-98 gi|7144456 296-352 92% Seq ID NO:84696 40-92 gi|18382222 454-506 94% Seq ID NO: 84696 40-89 gi|18090122491-540 94% Seq ID NO: 84696 55-92 gi|18381283 389-426 94% Seq ID NO:84779 145-167 gi|46985397 123-101 100% Seq ID NO: 84814 20-58gi|28916076 1-41 92% Seq ID NO: 47631 473-497 gi|54549558 143-167 100%Seq ID NO: 84950 15-54 gi|28916076 1-40 97% Seq ID NO: 84950 8-35gi|159473 34-61 100% Seq ID NO: 84950 8-35 gi|2454547 227-200 100% SeqID NO: 84950 8-35 gi|551594 194-221 100% Seq ID NO: 84950 8-35 gi|551595669-696 100% Seq ID NO: 84950 8-35 gi|18032254 34-61 100% Seq ID NO:84950 12-35 gi|18477256 279-256 100% Seq ID NO: 84950 12-35 gi|18477260325-302 100% Seq ID NO: 84950 12-35 gi|18477262 634-611 100% Seq ID NO:84950 8-29 gi|18477259 22-1 100% Seq ID NO: 84950 8-29 gi|18477261 22-1100% Seq ID NO: 84950 8-29 gi|37780968 22-1 100% Seq ID NO: 84996246-365 gi|32324211 375-494 84% Table 2 Legend: ¹ H. glycines Clone IDNo as set forth in Sequence Listing feature fields; searching the H.glycines sequence identifier in column 1 identifies the correspondingSEQ ID NO for that sequence ²nucleotide position in SEQ ID NOcorresponding to Clone ID No in column 1 that matches with position ofsequence of GeneID in adjacent cell on same row of table 2 ³GeneIDnumber of corresponding matching sequence hit from public database thatmatches with position of Clone ID No from column 1; derivative organisminformation is associated with the GeneID No. ⁴nucleotide position inGeneID that matches with nucleotides specified on same row correspondingto sequence of Clone ID SEQ ID NO ⁵percent identity between the twosequences in Clone ID and GeneID

Surprisingly the inventors have also discovered that somepolynucleotides of the present invention exhibit homology with variousinsect pests of plants and animals, as illustrated in Table 3. Thisprovides an opportunity to express in plant cells polynucleotidesexemplified in Table 3 as double stranded RNA sequences, providingcontrol of many of these insect pests of plants and animals. Mosquitoes,for example, are well known as vectors for spreading malaria, yellowfever, encephalitis, filarial parasites and other serious diseases. Malemosquitoes feed exclusively on plant nectar and on plant cell exudates,and female mosquitoes feed on plants when a blood meal is not available.The present invention therefore provides a means for applying theexemplary sequences as dsRNA molecules expressed in plant cells as ameans for controlling nematode and insect pests by expression ofsequences identified as representative of common sequences between thetwo species.

TABLE 3 SCN vcDNA Sequences and Insect Nucleotide Sequence HomologousSEQ ID NO¹ Position² Gene ID³ Position⁴ % identity⁵ Genus species⁶ SeqID NO: 68821 138-159 CRA|agCT42044 945-966 100% Anopheles gambiae Seq IDNO: 79019 537-557 CRA|agCT43147 1437-1457 100% Anopheles gambiae Seq IDNO: 47443 622-642 CRA|agCT43876 833-853 100% Anopheles gambiae Seq IDNO: 73243 46-69 CRA|agCT44110 215-238 100% Anopheles gambiae Seq ID NO:54820 116-136 CRA|agCT44330 197-177 100% Anopheles gambiae Seq ID NO:65924 13-33 CRA|agCT44378 3440-3460 100% Anopheles gambiae Seq ID NO:53889 589-609 CRA|agCT44871 13437-13457 100% Anopheles gambiae Seq IDNO: 66729 38-59 CRA|agCT45079 733-754 100% Anopheles gambiae Seq ID NO:60455 259-279 CRA|agCT45391 1624-1644 100% Anopheles gambiae Seq ID NO:69523 121-145 CRA|agCT45432 192-216  96% Anopheles gambiae Seq ID NO:80670 47-67 CRA|agCT46846 1180-1160 100% Anopheles gambiae Seq ID NO:81791 47-69 CRA|agCT46968 1752-1730 100% Anopheles gambiae Seq ID NO:67053 313-339 CRA|agCT47874 1073-1098  96% Anopheles gambiae Seq ID NO:59568 697-718 CRA|agCT48203 192-171 100% Anopheles gambiae Seq ID NO:72334 238-258 CRA|agCT49436 729-749 100% Anopheles gambiae Seq ID NO:63951 103-123 CRA|agCT49483 652-672 100% Anopheles gambiae Seq ID NO:62555 130-150 CRA|agCT51096 2549-2569 100% Anopheles gambiae Seq ID NO:46459 342-364 CRA|agCT51427 215-237 100% Anopheles gambiae Seq ID NO:66786 307-327 CRA|agCT52597 1900-1920 100% Anopheles gambiae Seq ID NO:68784  4-24 CRA|agCT55082 20600-20620 100% Anopheles gambiae Seq ID NO:53634 761-804 CRA|agCT55207 620-577  89% Anopheles gambiae Seq ID NO:53634 754-774 CRA|agCT55207 597-577 100% Anopheles gambiae Seq ID NO:53635  58-101 CRA|agCT55207 577-620  89% Anopheles gambiae Seq ID NO:53635  88-108 CRA|agCT55207 577-597 100% Anopheles gambiae Seq ID NO:73360 270-291 CRA|agCT55621 744-723 100% Anopheles gambiae Seq ID NO:82401 107-127 CRA|agCT55677 848-868 100% Anopheles gambiae Seq ID NO:55012 650-670 EBI|221 281-261 100% Anopheles gambiae Seq ID NO: 78551 90-110 EBI|2300 624-644 100% Anopheles gambiae Seq ID NO: 68299  84-105EBI|2307 2080-2101 100% Anopheles gambiae Seq ID NO: 47206  84-105EBI|2307 2080-2101 100% Anopheles gambiae Seq ID NO: 68556 241-262EBI|4053 3009-3030 100% Anopheles gambiae Seq ID NO: 70190 174-194EBI|4283 5460-5440 100% Anopheles gambiae Seq ID NO: 53886 405-425EBI|5326 2105-2085 100% Anopheles gambiae Seq ID NO: 70190 243-266EBI|8982 1366-1389 100% Anopheles gambiae Seq ID NO: 51267 164-184EBI|9090 121-141 100% Anopheles gambiae Seq ID NO: 55175 12-41gi|11119314 36-7  100% Andrya cuniculi Seq ID NO: 66692 12-41gi|11119314 36-7  100% Andrya cuniculi Seq ID NO: 55175 12-41gi|11119315 36-7  100% Paranoplocephala sp. Seq ID NO: 66692 12-41gi|11119315 36-7  100% Paranoplocephala sp. Seq ID NO: 55175 12-41gi|11119317 36-7  100% Paranoplocephala arctica Seq ID NO: 66692 12-41gi|11119317 36-7  100% Paranoplocephala arctica Seq ID NO: 55175 12-41gi|11119319 36-7  100% Paranoplocephala serrata Seq ID NO: 66692 12-41gi|11119319 36-7  100% Paranoplocephala serrata Seq ID NO: 47304 674-696gi|23186984 477-499 100% Echinococcus granulosus Seq ID NO: 77038 16-43gi|2463301 112-85   96% Neogryporhynchus cheilancristrotus Seq ID NO:59901 75-96 gi|31365321 176-155 100% Toxoptera citricida Seq ID NO:51042 482-505 gi|31365444 320-297 100% Toxoptera citricida Seq ID NO:77354  97-117 gi|31365580 644-664 100% Toxoptera citricida Seq ID NO:82622 494-514 gi|37804570 57-77 100% Rhopalosiphum padi Seq ID NO: 6686173-95 gi|46996593 400-378 100% Acyrthosiphon pisum Seq ID NO: 6400776-96 gi|46996721 468-448 100% Acyrthosiphon pisum Seq ID NO: 6008170-96 gi|46998065 535-562  96% Acyrthosiphon pisum Seq ID NO: 54610316-337 gi|46998427 300-279 100% Acyrthosiphon pisum Seq ID NO: 47304674-695 gi|47163116 387-408 100% Echinococcus granulosus Seq ID NO:62977 216-237 gi|47514887 260-281 100% Acyrthosiphon pisum Seq ID NO:58768 293-314 gi|47517134 76-55 100% Acyrthosiphon pisum Seq ID NO:63826 72-92 gi|47522032 581-561 100% Acyrthosiphon pisum Seq ID NO:82094 183-203 gi|47533062 209-189 100% Acyrthosiphon pisum Seq ID NO:72433 558-578 gi|47536611 364-384 100% Acyrthosiphon pisum Seq ID NO:57774 203-223 gi|47536768 381-401 100% Acyrthosiphon pisum Seq ID NO:55175 12-41 gi|54306309 36-7  100% Anoplocephaloides cf. Seq ID NO:66692 12-41 gi|54306309 36-7  100% Anoplocephaloides cf. Seq ID NO:55175 12-41 gi|54306311 36-7  100% Anoplocephaloides kontrimavichusi SeqID NO: 66692 12-41 gi|54306311 36-7  100% Anoplocephaloideskontrimavichusi Seq ID NO: 55175 12-41 gi|54306312 36-7  100%Anoplocephaloides lemmi Seq ID NO: 66692 12-41 gi|54306312 36-7  100%Anoplocephaloides lemmi Seq ID NO: 55175 12-41 gi|54306316 36-7  100%Andrya rhopalocephala Seq ID NO: 66692 12-41 gi|54306316 36-7  100%Andrya rhopalocephala Seq ID NO: 55175 12-41 gi|54306318 36-7  100%Diandrya composita Seq ID NO: 66692 12-41 gi|54306318 36-7  100%Diandrya composita Seq ID NO: 55175 12-41 gi|54306319 36-7  100%Mosgovoyla pectinata Seq ID NO: 66692 12-41 gi|54306319 36-7  100%Mosgovoyla pectinata Seq ID NO: 55175 12-41 gi|54306320 36-7  100%Moniezia sp. Seq ID NO: 66692 12-41 gi|54306320 36-7  100% Moniezia sp.Seq ID NO: 55175 12-41 gi|54306321 36-7  100% Monoecocestus americanusSeq ID NO: 66692 12-41 gi|54306321 36-7  100% Monoecocestus americanusSeq ID NO: 55175 12-41 gi|54306322 36-7  100% Paranoplocephalablanchardi Seq ID NO: 66692 12-41 gi|54306322 36-7  100%Paranoplocephala blanchardi Seq ID NO: 55175 12-41 gi|54306323 36-7 100% Paranoplocephala etholeni Seq ID NO: 66692 12-41 gi|54306323 36-7 100% Paranoplocephala etholeni Seq ID NO: 55175 12-41 gi|54306324 36-7 100% Paranoplocephala fellmani Seq ID NO: 66692 12-41 gi|54306324 36-7 100% Paranoplocephala fellmani Seq ID NO: 55175 12-41 gi|54306325 36-7 100% Paranoplocephala gracilis Seq ID NO: 66692 12-41 gi|54306325 36-7 100% Paranoplocephala gracilis Seq ID NO: 55175 12-41 gi|54306326 36-7 100% Paranoplocephala longivaginata Seq ID NO: 66692 12-41 gi|5430632636-7  100% Paranoplocephala longivaginata Seq ID NO: 55175 12-41gi|54306327 36-7  100% Paranoplocephala macrocephala Seq ID NO: 6669212-41 gi|54306327 36-7  100% Paranoplocephala macrocephala Seq ID NO:55175 12-41 gi|54306328 36-7  100% Paranoplocephala cf. Seq ID NO: 6669212-41 gi|54306328 36-7  100% Paranoplocephala cf. Seq ID NO: 55175 12-41gi|54306329 36-7  100% Paranoplocephala kalelai Seq ID NO: 66692 12-41gi|54306329 36-7  100% Paranoplocephala kalelai Seq ID NO: 55175 12-41gi|54306331 36-7  100% Paranoplocephala primordialis Seq ID NO: 6669212-41 gi|54306331 36-7  100% Paranoplocephala primordialis Seq ID NO:55175 12-41 gi|54306332 36-7  100% Schizorchis sp. Seq ID NO: 6669212-41 gi|54306332 36-7  100% Schizorchis sp. Seq ID NO: 60083 406-426gi|55794409 449-429 100% Acyrthosiphon pisum Seq ID NO: 62688  3-23gi|55802365 769-789 100% Acyrthosiphon pisum Seq ID NO: 82614 269-290gi|55803329 472-493 100% Acyrthosiphon pisum Seq ID NO: 54865 19-41gi|55806106 709-731 100% Acyrthosiphon pisum Seq ID NO: 65500 126-146gi|55810448 308-328 100% Acyrthosiphon pisum Seq ID NO: 80198 225-245gi|55810583 245-265 100% Acyrthosiphon pisum Seq ID NO: 51184 74-94gi|55814836 337-317 100% Acyrthosiphon pisum Seq ID NO: 52762  1-36gi|6467344 700-665  91% Duplicibothrium paulum Table 3 Legend: ¹H.glycines Clone ID No as set forth in Sequence Listing feature fields;searching the H. glycines sequence identifier in column 1 identifies thecorresponding SEQ ID NO for that sequence ²nucleotide position in SEQ IDNO corresponding to Clone ID No in column 1 that matches with positionof sequence of Gene ID in column 3 on same row ³Gene ID number ofcorresponding matching sequence hit from public database that matcheswith position of Clone ID No from column 1; information in table issorted by column 3 ⁴Gene ID nucleotide position in column 3 that matcheswith nucleotides specified on same row corresponding to sequence of SCNClone ID ⁵percent identity between the two sequences in Clone ID andGene ID (comparison of identity between column 2 and column 4 sequences)⁶Genus and species of organism corresponding to gene sequence set forthin Column 3

Example 6

This example illustrates the suppression of one or more genes in asoybean cyst nematode by providing in the diet of the nematode a doublestranded RNA consisting of a nucleotide sequence that is complementaryto the messenger RNA sequence expressed from the one or more soybeancyst nematode genes.

Soybean cyst nematode J2 larvae are treated with a double stranded RNAderived from a nucleotide sequence selected from the group consisting ofSEQ ID NO:1-SEQ ID NO:97729 in a soaking assay as described in WO03052110. Briefly, freshly hatched nematode larvae are treated in asoaking buffer (10 mM octopamine in M9 salts, 1 mg/ml FITC in DMF) withor without 2 microgram/microliter of dsRNA for four hours at roomtemperature. Larvae ingesting the solution are fluorescent. Thefluorescent larvae are separated from non-fluorescent larvae, theninoculated into soil containing a germinated soybean seedling. Thenumber of cysts on each plant are counted about 35 days afterinoculation. The tested dsRNA molecules that demonstrate significantreduction in the number of cysts counted are then made into plantexpression cassettes contained in DNA constructs designed for plant celltransformation. DNA constructs generally comprise constitutive promotersthat cause transcription of a linked DNA that transcribes a dsRNA.Promoters that may exhibit enhanced expression in root tissue may beparticularly useful for expressing dsRNA effective against soybean cystnematodes.

These DNA constructs are transformed into soybean plant cells and thecells regenerated into plants. The plants are tested either as Ro plantsfor nematode resistance or seed is collected and the R1 seed isgerminated and the R1 plant roots tested for nematode resistance.Resistance is demonstrated if the transgenic plants have a significantreduction in cyst number or cyst development.

Example 7

This example describes DNA constructs and the expression of a chimericRNA molecule of the present invention in a transgenic soybean plantcell. The DNA constructs described herein comprise a promoter thatcauses transcription of an operably linked DNA into an RNA in a soybeancell, the DNA and the transcribed RNA of one or more segments exhibitinghomology or complementarity to a soybean cyst nematode contiguous atleast about 21-mer nucleotide sequence (DNA or RNA). Exemplary soybeancyst nematode DNA segments were previously described in Table 1 and arefurther identified in the Sequence Listing as SEQ ID NO:1-SEQ IDNO:45568. When expressed in a plant cell, the DNA construct provides anRNA transcript molecule comprising a self-complimentary segment, aportion of which folds into a double stranded RNA (dsRNA). Detection ofthe RNA transcript expressed in a cell or tissue of a transgenic plantis diagnostic for the DNA construct(s) that comprises a region of asoybean cyst nematode DNA molecule, and demonstrates that the DNAsegment from which the dsRNA molecule is derived istranscribed/expressed in the transgenic soybean cells. Therefore, thetranscribed RNA becomes available in the diet of the nematode as itfeeds on a plant root cell. The RNA comprises a region that is doublestranded and is complementary to a naturally occurring polynucleic acidmolecule contained in a soybean cyst nematode cell, and when ingested bythe nematode results in suppression of the normal level of the naturallyoccurring molecule.

Exemplary DNA constructs of the present invention have variousregulatory elements that provide transcription or enhance expression orstability of an RNA molecule transcribed from a transgene in a plantcell. For example, a promoter element of a DNA construct of the presentinvention provides expression of an RNA transcript in a plant cell.Promoters, which can cause the transcription of a linked heterologousDNA are generally known in the art, for example, DNA plant viruspromoters (P-CaMV35S, U.S. Pat. Nos. 5,352,605 and 5,196,525, comprisinga duplicated enhancer region herein referred to as P-e35S; P-FMV35S,U.S. Pat. Nos. 5,378,619 and 5,018,100, herein incorporated by referencein their entirety), and various plant derived promoters, for example,plant actin promoters (P-Os.Act, U.S. Pat. Nos. 5,641,876 and 6,429,357,herein incorporated by reference in their entirety), and chimericpromoters, for example, P-FMV-Elf1α (U.S. Pat. No. 6,660,911 and otherchimeric promoters disclosed therein, herein incorporated by referencein their entirety). Additionally, promoters that provide enhancedexpression in root cells relative to other plant cells, may be testedand selected to express the RNA molecules of the present invention. TheDNA constructs described in this example utilize the P-e35S and P-FMVpromoters to drive the transcription of a DNA and expression of a dsRNAthat exhibits homology to a portion of a soybean cyst nematodenucleotide sequence. For example, a nucleotide sequence was assembledconsisting of two segments, the forward and reverse nucleotide sequenceof SEQ ID NO:22219 from nucleotide position 552-699, linked by anamorphous 20-200 nucleotide segment that did not exhibit any knowncomplementarity to the SCN genome sequences. Bioinformatics analysisindicates that the nucleotide sequence of SEQ ID NO:22219 corresponds toan open reading frame encoding an SCN specific proteasome A-type subunitpeptide referred to herein as Pas-4. This chimeric sequence wasincorporated into plant expression vectors for use in testing dsRNAmediated suppression of the pas-4 target gene. The DNA constructs 5749(P-FMV/Pas-4-dsRNA/E6 3′ UTR) was thus assembled and comprises thenecessary transfer molecules and regulatory molecules to provideintegration into the genome of plant cells and expression of the dsRNAmolecule therein.

The DNA constructs comprise a T-DNA region that is transferred into thegenome of a plant cell by an Agrobacterium- or Rhizobium-mediated plantcell transformation method, and additional regulatory elements, forexample, a 3′ untranslated region (3′ UTR) of the SIE6-3B gene fromGossypium barbadense, herein referred to as E6 3′ UTR (John Plant MolBiol 30:297-306, 1996, NCBI accession U30508, nucleotide position fromabout 992-1304). The DNA construct 5749 (P-FMV/Pas-4-dsRNA/E6 3′ UTR)was transferred into Agrobacterium rhizogenes strain.

A transgenic root culture of soybean has been shown to support soybeancyst nematode infection and is useful for the expression of transgenes(Narayanan, et al., Crop Sci. 39:1680-1686, 1999 and Cho et al., Planta(2000) 210:195-204). Agrobacterium rhizogenes transformed to contain thedescribed DNA construct 5749 was used to transformed soybean cells andcreate independent transgenic root cultures, referred to herein asevents 5749-1, etc. Tissues from the transgenic root cultures wereassayed for expression of the chimeric SCN gene suppression RNAmolecule. Transgenic root tissues were selected using appropriateselection pressures. Transgenic root tissues from each event werescreened for the presence of the fluorescence marker expression that wasintegrated into and adjacent to the dsRNA expression construct. Thetransgenic root tissues were also screened for the presence of siRNAsegments produced from exposure to the root tissue cells' endogenousDICER molecules. siRNA segments were screened for identity to segmentsof the corresponding dsRNA coding sequences expressed from the plasmidconstruct expression cassettes. Methods for detecting the presence of anexpressed RNA in a cell are known in the art. For example, in thisexample, the presence of the 3′ UTR was detected using primers thatfunctioned to amplify the UTR sequence from the expressed RNA sequence.A TAQMAN method was then used along with a 3′ UTR specific fluorescenceprobe to detect the UTR as well as provide information on the relativelevel of expression from the construct. The data is shown in Table 4.

TABLE 4 Levels of Pas-4-dsRNA in Transgenic Soybean Root Cells. EventAve fluorescense St Dev siRNA Northern Vector control 0.00 0.00 ND5749-1 2.30 1.05 + 5749-3 3.83 1.39 + 5749-4 5.47 2.44 + 5749-5 3.730.64 + 5749-8 0.45 0.14 ND 5749-10 0.16 0.02 ND 5749-11 0.24 0.04 +5749-12 0.33 0.17 ND 5749-A 3.14 0.70 + 5749-B 2.91 0.27 + ND — notdetected

The data in Table 4 indicates that ten events comprising the Pas-4-dsRNAcontained detectable levels of the RNA molecule. Northern blot analysisof these events showed detectable levels of siRNA that specificallyhybridizes to DNA probes made from a homologous fragment of the Pas-4coding region. These results demonstrate that soybean cells can betransformed with DNA constructs for expression of dsRNA moleculesspecific for gene suppression of SCN target genes, and that thetransformed plant cells recognize the RNA molecules and dice them intodetectable siRNA molecules that may be useful for specific genesuppression of the target gene(s) when provided in the diet of soybeancyst nematodes.

All patent publications cited in this specification are incorporatedherein by reference to the same extent as if each individual publicationor patent application was specifically and individually indicated to beincorporated by reference.

The Sequence Listing is submitted along with this specification on twocompact discs. One disc is labeled ‘Sequence Listing’ according to 37CFR §1.52(e)(4), and the other disc is labeled ‘CRF’ (computer readableform) according to 37 CFR §1.821(e). Each disc contains a single 271,645kilo-byte text file labeled ‘SCN_seqListing.txt’, created on Feb. 22,2005, in IBM-PC format and is compatible with IBM-PC, MS-Windows,Macintosh, and UNIX operating systems. The sequence listing informationrecorded in computer readable form is identical to the written compactdisc sequence listing. The Sequence Listing text file is incorporatedherein by reference.

1. An isolated Heterodera glycines nucleotide sequence selected from the group consisting of SEQ ID NO:22219, SEQ ID NO:45701, and SEQ ID NO:52890.
 2. An isolated nucleotide sequence selected from the group consisting of SEQ ID NO:45701 and SEQ ID NO:52890 encoding a polypeptide, wherein said polypeptide is isolated from Heterodera species.
 3. A method for suppressing a peptide coding sequence in a Heterodera glycines pest, comprising selecting a target polynucleotide sequence of a Heterodera glycines pest genome comprising at least from about 24 contiguous nucleotides, expressing the polynucleotide sequence as an RNA sequence that forms a double stranded RNA structure in a plant cell, and providing said plant cell in the diet of said Heterodera glycines pest, wherein uptake of the contents of said cell by said pest results in the morbidity and/or mortality of said pest, and wherein said target polynucleotide sequence is selected from the group consisting of SEQ ID NO:22219, SEQ ID NO:45701, and SEQ ID NO:52890.
 4. The method of claim 3, wherein said RNA sequence comprises contiguous nucleotides from at least two or more target polynucleotide sequences from two or more different coding sequences of a Heterodera glycines pest.
 5. The method of claim 4, wherein each of said two or more different coding sequences are isolated from different Heterodera glycines cDNA sequences.
 6. A method for controlling Heterodera glycines infection in a soybean plant comprising a) transforming a soybean plant cell with a DNA construct that expresses a dsRNA molecule in said soybean cell; and b) regenerating said transformed soybean plant cell into a fertile transgenic soybean plant that exhibits resistance to infection by Heterodera glycines; wherein said DNA construct comprises a polynucleotide sequence comprising at least about 24 contiguous nucleotides selected from the group consisting of SEQ ID NO:22219, SEQ ID NO:45701, and SEQ ID NO:52890.
 7. The method of claim 6, wherein said DNA construct comprises at least one selectable marker providing herbicide tolerance to the transgenic soybean plant.
 8. The method of claim 7, wherein said transgenic soybean plant exhibits tolerance to the herbicide glyphosate.
 9. A plasmid library comprising SCN genome sequences wherein said library is ATCC Patent Deposit No. PTA-6583 deposited on Feb. 15,
 2005. 10. An isolated and purified plasmid selected from the library of claim
 9. 11. A nucleotide sequence selected from the group consisting of SEQ ID NO:1-SEQ ID NO:45568, and the complement thereof, isolated from a plasmid in the library of claim
 9. 12. A vcDNA sequence selected from the group consisting of SEQ ID NO:45569-SEQ ID NO:97729, wherein said vcDNA sequence is isolated from a plasmid in the library of claim
 9. 13. An amino acid sequence selected from the group consisting of SEQ ID NO:119146-SEQ ID NO:121220 encoded by the vcDNA of claim
 12. 14. A transgenic soybean plant comprising the double stranded structure of claim
 3. 15. A seed produced from the plant of claim 14, wherein said seed comprises said structure.
 16. A plant grown from the seed of claim
 15. 